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00 . Abstract 
(N 

In this paper we study random graphs with independent and identically distributed degrees 
of which the tail of the distribution function is regularly varying with exponent t £ (2, 3). 

The number of edges between two arbitrary nodes, also called the graph distance or hopcount, 
in a graph with N nodes is investigated when N oo. When t £ (2,3), this graph distance 
1 ^ ' grows like 2 | }°||°^'^)| ■ In different papers, the cases r > 3 and r e (1,2) have been studied. 

■ We also study the fluctuations around these asymptotic means, and describe their distributions. 

The results presented here improve upon results of Reittu and Norros, who prove an upper 
bound only. 
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00 

^ ■ 1 Introduction 

' The study of complex networks plays an increasingly important role in science. Examples of complex 
(— I networks are electrical power grids and telephony networks, social relations, the World-Wide Web 
, and Internet, co-authorship and citation networks of scientists, etc. The structure of networks 
^ ■ affects their performance and function. For instance, the topology of social networks affects the 
^ ■ spread of information and infections. Measurements on complex networks have shown that many 
. ^ ' networks have similar properties. A first key example of such a fundamental network property is the 
^ fact that typical distances between nodes are small, which is called the 'small world' phenomenon. 
' A second key example shared by many networks is that the number of nodes with degree k falls 
off as an inverse power of k, which is called a power law degree sequence. See |1J 1291 136j and the 
references therein for an introduction to complex networks and many examples where the above 
two properties hold. 

The current paper presents a rigorous derivation for the random fluctuations of the graph 
distance between two arbitrary nodes (also called the geodesic, and in Internet called the hopcount) 
in a graph with infinite variance degrees. The model studied here is a variant of the configuration 
model. The infinite variance degrees include power laws with exponent r E (2,3). In practice, 
power exponents are observed ranging between r = 1.5 and r = 3.2 (see UHl)- 

In a previous paper of the first two authors with Van Mieghem [2^, we investigated the finite 
variance case r > 3. In we study the case where r G (1, 2). Apart from the critical cases r = 2 
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and r = 3, we have thus investigated all possible values of r. The paper serves as a survey to 
the results and, in particular, describes how our results can be applied to Internet data, describes 
related work on random graphs that are similar, though not identical to ours, and gives further 
open problems. Finally, in |23j . we also investigate the structure of the connected components 
in the random graphs under consideration. See ^ ^ |25j for an introduction to classical random 
graphs. 

This section is organised as follows. In Section ITTTI we start by introducing the model, in Section 
II. 21 we state our main results. Section [T31 is devoted to related work, and in Section [Ol we describe 
some simulations for a better understanding of the results. 

1.1 Model definition 

Fix an integer A^. Consider an i.i.d. sequence Di, D2, ■ ■ ■ , D^^. We will construct an undirected 
graph with N nodes where node j has degree Dj. We will assume that Lj^ = '}2f=i^j even. 
If Ljv is odd, then we increase Dpf is by 1. This change will make hardly any difference in what 
follows, and we will ignore this effect. We will later specify the distribution of Di. 

To construct the graph, we have N separate nodes and incident to node j, we have Dj stubs. 
All stubs need to be connected to another stub to build the graph. The stubs are numbered in 
an arbitrary order from 1 to Ljv- We start by connecting at random the first stub with one of the 
Lff — 1 remaining stubs. Once paired, two stubs form a single edge of the graph. We continue the 
procedure of randomly choosing and pairing the stubs until all stubs are connected. Unfortunately, 
nodes having self-loops may occur. However, self-loops are scarce when N ^ 00. 

The above model is a variant of the configuration model, which, given a degree sequence, is the 
random graph with that given degree sequence. For a graph, the degree sequence of that graph is 
the vectors of which the fc*^ coordinate equals the frequency of nodes with degree k. In our model, 
the degree sequence is very close to the distribution of the nodal degree D of which Di , . . . , -Djv are 
i.i.d. copies. The probability mass function and the distribution function of the nodal degree law 
are denoted by 

nDi=j)=fj, i = l,2,..., and F(a;)=5^/,-, (1.1) 

where [x\ is the largest integer smaller than or equal to x. Our main assumption is that we take 

l-i?(x) =x~^+^L(x), (1.2) 

where r S (2, 3) and L is slowly varying at infinity. This means that the random variables Di obey 
a power law, and the factor L is meant to generalize the model. We work under a slightly more 
restrictive assumption: 

Assumption 1.1 There exists 7 G [0, 1) and C > such that 

Comparing with (|1.2j) . we see that the slowly varying function L in (|1.2j) should satisfy 

g-C(iogx)^ < L(x) < e^(^°*5^)\ (1.4) 

1.2 Main results 

We define the graph distance Hpf between the nodes 1 and 2 as the minimum number of edges 
that form a path from 1 to 2. By convention, the distance equals cxd if 1 and 2 are not connected. 
Observe that the distance between two randomly chosen nodes is equal in distribution to H^f, 
because the nodes are exchangeable. We now describe our main result. 
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Theorem 1.2 (Fluctuations of the Graph Distance) Assume that Assumvtion M . 1\ holds and 
fix T € (2,3) in Then there exist random variables {Ra)ae{~ifi] such that, as N ^ oo, 



log log N 



+ 1 



<oo) =F{Ra^ =l)+o{l), l£Z, (1.5) 



L|log(r-2 

inhfTP n — I log log ^ I log log Af ^ ^ i n] 

Where - L|iog(r-2)|J ~ |iog(T-2)| ^ {-^^^\- 

In words, Theorem 11.21 states that for r E (2,3), the graph distance Hj^ between two ran- 
domly chosen connected nodes grows proportional to log log of the size of the graph, and that the 
fluctuations around this mean remain uniformly bounded in A^. 

We identify the laws of (^a)ae(-i,o] below. Before doing so, we state two consequences of the 
above theorem: 

Corollary 1.3 (Convergence in Distribution along Subsequences) Along the sequence Ni^ = 

[N^ J , where k = 1,2, . . . , and conditionally on 1 and 2 being connected, the random vari- 

ables 

^ log log Nk 

converge in distribution to Raj^^ , as k ^ oo 



log(r 



(1.6) 



Simulations illustrating the weak convergence in Corollarv 11.31 are discussed in Section [1.41 In 
the corollary below, we write that an event E occurs whp for the statement that ¥(E) = 1 — o(l). 

Corollary 1.4 (Concentration of the Graph Distance) 

(i) Conditionally on 1 and 2 being connected, the random variable Hj^ is, whp, in between 

(ii) Conditionally on 1 and 2 being connected, the random variables Hj^ — |°|(°^'^) form a tight 
sequence, i.e., 



H„<oo)=l. (1.7) 



/ 1 log log N I 

lim limsupP \Hr, - 2—^—^—-\ < K 

K^oo AT^oo V I log(r - 2)1 

We need a limit result from branching processes theory before we can identify the limiting 
random variables (i?a)ag(_i q] . In Section |2 we introduce a delayed branching process {Zk}k>i, 
where in the first generation the offspring distribution is chosen according to ()1.1() and in the second 
and further generations the offspring is chosen in accordance to g given by 

.. = ^^^^, . = 0,1,..., (1.8) 

where /x = ^^iJfj- The branching process {Zk} has infinite expectation. Branching processes 
with infinite expectation have been investigated in ^UHllinS]- Assumption 11.11 using the results 
in JH]) implies that 

(r - 2)" • log(Z„ V 1) ^ y, a.s., (1.9) 

where xV y = max{x,y}. See Section |21 and the references there for more details. Then, we can 
identify the law of the random variables {Ra)ae{-~ifi] ^ follows: 

Theorem 1.5 (The Limit Laws) For a G (—1,0], 

¥{Ra >l)= pfmin [(r - 2)~'Y^^'> + (r - 2)"-^'^^)] < (r - 2) r'/2l+a|y{i)y{2) ^ q 

where c/ = 1 if I is even, and zero otherwise, and y , y*^^ are two independent copies of the limit 
random variable in 
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In Remarks 14.11 and IA.1.51 below, we will explain that our results also apply to the usual 
configuration model, where the number of nodes with a given degree is fixed, when we study the 
graph distance between two uniformly chosen nodes, and the degree distribution satisfied certain 
conditions. For the precise conditions, see Remark IA.1.51 

I. 3 Related work 

There is a wealth of related work which we now summarize. The model investigated here was also 
studied in jSSj, with 1—F{x) = x~'^'^^ L{x), where r G (2, 3) and L denotes a slowly varying function. 
It was shown in that whp the graph distance is bounded from above by 2 ||°||°^'^ | (1 + o(l)). 
We improve the results in by deriving the asymptotic distribution of the random fiuctuations 
of the graph distance around 2[ ||°||°^'^^ |J. Note that these results are in contrast to jSOj Section 

II. F, below (56)], where it was suggested that if r < 3, then an exponential cut-off is necessary 
to make the graph distance between an arbitrary pair of nodes well-defined. The problem of the 
graph distance between an arbitrary pair of nodes was also studied non-rigorously in ^3]) where 
also the behavior when r = 3 and x L{x) is the constant function, is included. In the latter 
case, the graph distance scales like j^^^^j^- A related model to the one studied here can also be 
found in 31^, where a graph process is defined by adding and removing edges. In the authors 
prove similar results as in [S21 for this related model. 

The graph distance for r > 3, r G (1,2), respectively was treated in two previous publications 
|21j and |22], respectively. We survey these results together with results on the connected compo- 
nents in |2S|- In Uni, we also show that when r > 2, the diameter is bounded from below by a 
constant times logN, which, when r G (2,3) should be contrasted with the average graph distance, 
which is or order log log A^. Finally, in (23j also the connected components are studied under the 
condition that /i = IE[-Di] > 2, and the results in this paper are used to show that whp there 
exists a largest connected component of size qN[l + o(l)], where q is the survival probability of the 
delayed branching process, while all other connected components are of order at most log A^. 

There is substantial work on random graphs that are, although different from ours, still similar 
in spirit. In [J, random graphs were considered with a degree sequence that is precisely equal to 
a power law, meaning that the number of nodes with degree k is precisely proportional to k~'^ . 
Aiello et al. show that the largest connected component is of the order of the size of the graph 
when T < To = 3.47875 . . ., where tq is the solution of ({t — 2) — 2C(r — 1) = 0, and where C is the 
Riemann zeta function. When r > tq, the largest connected component is of smaller order than the 
size of the graph and more precise bounds are given for the largest connected component. When 
T £ (1,2), the graph is whp connected. The proofs of these facts use couplings with branching 
processes and strengthen previous results due to Molloy and Reed [23|2H!. For this same model, 
Dorogovtsev et al. |171 [T5ij investigate the leading asymptotics and the fiuctuations around the 
mean of the graph distance between arbitrary nodes from a theoretical physics point of view, using 
mainly generating functions. 

A second related model can be found in |121ll3j . where edges between nodes i and j are present 

with probability equal to WiWj / Ylii'^i some 'expected degree vector' w = {wi, . . . ,Wn). It is 

assumed that maxj tD? < J^i'^ii so that WiWj / Y^iWi are probabilities. In 12 , Wi is often taken as 

1 1 
Wi = ci , where c is a function of N proportional to A^-i . In this case, the degrees obey a power 

law with exponent r. Chung and Lu jTJj show that in this case, the graph distance between two 

uniformly chosen nodes is whp proportional to log A(H-o(l)) when r > 3, and 2 ||°||°^'^^| (l + o(l)) 

when r G (2, 3). The difference between this model and ours is that the nodes are not exchangeable 

in ^21, but the observed phenomena are similar. This result can be heuristically understood as 

follows. Firstly, the actual degree vector in |12, should be close to the expected degree vector. 

Secondly, for the expected degree vector, we can compute that the number of nodes for which the 
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Figure 1: Histograms of the AS-count and graph distance in the configuration model with N = 
10, 940, where the degrees have generating function /t(s) in (jLllj) . for which the power law exponent 
T takes the value r = 2.25. The AS-data is lightly shaded, the simulation is darkly shaded. 



degree is at least k equals 

\{i : Wi > k}\ = \{i : ci'^ > k}\ oc 

Thus, one expects that the number of nodes with degree at least k decreases as k~'^~^^, similarly 
as in our model. In jJHj, Chung and Lu study the sizes of the connected components in the above 
model. The advantage of this model is that the edges are independently present, which makes the 
resulting graph closer to a traditional random graph. 

All the models described above are static, i.e., the size of the graph is fixed, and we have 
not modeled the growth of the graph. There is a large body of work investigating dynamical 
models for complex networks, often in the context of the World-Wide Web. In various forms, 
preferential attachment has been shown to lead to power law degree sequences. Therefore, such 
models intend to explain the occurrence of power law degree sequences in random graphs. See 
121 01 IS m IHl ini ^3 ^l 1201 and the references therein. In the preferential attachment model, 
nodes with a fixed degree m are added sequentially. Their stubs are attached to a receiving node 
with a probability proportional to the degree of the receiving node, thus favoring nodes with large 
degrees. For this model, it is shown that the number of nodes with degree k decays proportionally 
to k~^ the diameter is of order when m > 2 |S|, and couplings to a classical random 

graph G{N,p) are given for an appropriately chosen p in TD]. See also for a survey. 

Possibly, the configuration model is a snapshot of the above models, i.e., a realization of the 
graph growth processes at the time instant that the graph has a certain prescribed size. Thus, 
rather than to describe the growth of the model, we investigate the properties of the model at a 
given time instant. This is suggested in ^ Section VII. D], and it would be very interesting indeed 
to investigate this further mathematically, i.e., to investigate the relation between the configuration 
and the preferential attachment models. 

We study the above version of the configuration model to describe the topology of the Internet 
at a fixed time instant. In a seminal paper ^M,, Faloutsos et al. have shown that the degree 
distribution in Internet follows a power law with exponent r ~ 2.16 — 2.25. Thus, the power law 
random graph with this value of r can possibly lead to a good Internet model. In PSI, and inspired 
by the observed power law degree sequence in the power law random graph is proposed as 
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a model for the network of autonomous systems. In this graph, the nodes are the autonomous 
systems in the Internet, i.e., the parts of the Internet controlled by a single party (such as a 
university, company or provider), and the edges represent the physical connections between the 
different autonomous systems. The work of Faloutsos et al. in (T9| was among others on this graph 
which at that time had size approximately 10,000. 

In it is argued on a qualitative basis that the power law random graph serves as a better 
model for the Internet topology than the currently used topology generators. Our results can 
be seen as a step towards the quantitative understanding of whether the AS-count in Internet is 
described well by the average graph distance in the configuration model. The AS-count gives the 
number of physical links connecting the various autonomous domains between two randomly chosen 
nodes in the graph. 

To validate the model, we compare a simulation of the distribution of the distance between pairs 
of nodes in the power law random graph with the same value of and r to extensive measurements 
of the AS-count in Internet. In Figure ^ we see that AS-count in the model with the predicted 
value of r = 2.25 and the value of N from the data set fits the data remarkably well. 

In |291 Table II], many other examples are given of real networks that have power law degree 
sequences. Interestingly, there are many examples where the power law exponent is in (2,3), and 
it would be of interest to compare the average graph distance between an arbitrary pair of nodes 
in such examples. 

I. 4 Demonstration of Corollary 11.31 

By a simulation we explain the relevance of Theorem 11.21 and especially the relevance of Corollary 

II. 31 We have chosen to simulate the distribution ()1.8|) from the generating function: 

gris) = 1 - (1 - sy-^ for which g, = (-l)^-i T ^) ~ j ^ ^- (1-10) 

Defining 

f-^'^ = V^' ' ^e(2,3), (1.11) 

it is immediate that 

For fixed r, we can pick different values of the size of the simulated graph, so that for each two 
simulated values N and M we have Ojv = Omj i-S-, 

AT = m(^-2) for some integer k. For r = 2.8, 

we have taken the values 

Af = 1,000, A^ = 5,623, A^ = 48,697, Ar = 723,394. 

According to Theorem II. 2( the survival functions of the hopcount Hj^, satisfying = M*^"^"^^ 
run parallel on distance 2 in the limit for A^ — > oo. In Section [21 below we will show that the 
distribution with generating function (|l.llj) satisfies Assumption ll.il 



1.5 Organization of the paper 

The paper is organized as follows. We first review the relevant literature on branching processes in 
Section El We then describe the growth of shortest path graphs in Section 13 and we state coupling 
results needed to prove our main results. Theorems I1.2H1.5I in Section 0J In Section |S1 we prove 
three technical lemmas used in Section 0] We finally prove the coupling results in the Appendix. 
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Figure 2: Empirical survival functions of the graph distance for r = 2.8 and for the four values of 
N. 



2 Review of branching process theory with infinite mean 

Since we heavily rely on the theory of branching processes (BP's), we now briefly review this theory 
in the case where the expected value (mean) of the offspring distribution is infinite. We follow in 
particular jl6j , and also refer the readers to related work in [331 134j , and the references therein. 

For the formal definition of the BP we define a double sequence {^n,i}n>o,i>i of i.i.d. ran- 
dom variables each with distribution equal to the offspring distribution {gj} given in (|1.8() with 

distribution function G{x) = Yl\=o9j- "^^^ i^n} is now defined hy Zq = 1 and 

Zn+l='^Xn,i, n > 0. 
i=l 

In case of a delayed BP, we let Xq^i have probability mass function {fj}, independently of {Xn,i}n>i,i>i- 
In this section we restrict to the non-delayed case for simplicity. 

We follow Davies in who gives the following sufficient conditions for convergence of 

(r — 2)" log(l -|- Zn). Davies' main theorem states that if for some non-negative, non-increasing 
function j{x): 

(i) < 1 - G(x) < 2;-"+t(^), for large X and < a < 1, 

(ii) x'^^^^ is non-decreasing, 

(iii) 7(e^") < oo or equivalently dy < oo, 

then a" log(l -|- Zn) converges almost surely to a non-degenerate finite random variable Y with 
¥{Y = 0) equal to the extinction probability of {Z^}, whereas Y admits a density on (0,oo). 
Therefore, also a" log{Zn V 1) converges to Y almost surely. 

The conditions of Davies quoted as (i-iii) simplify earlier work by Seneta j34j . For example, 
for {g} in (|l.inj) . the above is valid with a = r — 2 and 7(x) = C(logx)~^, where C is sufficiently 
large. We prove in Lemma lA.l.ll below that for F as in Assumption 11.11 and G the distribution 
function of g in (|1.8j) . the conditions (i-iii) are satisfied with a = t — 2 and 7(x) = C(log x)'''"^. In 
particular, for (iii), we need that 7 < 1. 
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Let and y'^' be two independent copies of the limit random variable Y. In the course of the 
proof, we will encounter the random variable M = mintg2(K*y^' + k'^~^Y'-'^^), for some c G {0, 1}, 
and where /t = (r — 2)~^. The proof relies on the fact that, conditionally on y(i)y(2) > g, M has 
a density. The proof of this fact is as follows. The function (yi,y2) ^ + K^~*y2) is 

discontinuous precisely in the points (1/1,1/2) satisfying \/y2/yi = n G Z, and, conditionally 

on y(i)y(2) > t]2e random variables V-^^ and y'^' are independent continuous random variables. 
Therefore, conditionally on yf^'yf^) > q, the random variable M = mintgz('^*^'^' + At'^^^y'^') has 
a density. 

3 The growth of the shortest path graph 

In this section, we describe the growth of the shortest path graph (SPG). As a result, we will see 
that this growth is closely related to a BP {Z^''^-'} with the random offspring distribution {dj^^} 
given by 

N 

= l|£)_j_,_x}lF'(a' stub from node i is sampled|Di, . . . , L>jv) 



where, for an event A, \a denotes the indicator function of the event A. By the strong law of large 
numbers for N ^ 00, almost surely, 

.... , 1^ 



N 



and -^l^^^^^^,^^P(D=j + l) 

i=l 



so that a.s. 



g)"' ^{j + l)nD = j + l)/nD]=gj, A^^oo. (3.2) 



Therefore, the BP {Zj}'"^^}, with offspring distribution {g'^^'j, is expected to be close to a BP with 
offspring distribution {gj} given in 1)1.8^ . Consequently, in Section IXTl we state bounds on the 
coupling of the BP {Z^^'^'} to a BP {Zj^^} with offspring distribution {gj}. This allows us to prove 
Theorems 11.21 and 11.51 in Section ^ 

The shortest path graph (SPG) from node 1 is the power law random graph as observed from 
node 1, and consists of the shortest paths between node 1 and all other nodes {2, . . . ,N}. As will 
be shown below, the SPG is not necessarily a tree because cycles may occur. Recall that two stubs 
together form an edge. We define Z'^'^^ = Di and, for k > 2, we denote by Z^''^^ the number 
of stubs attached to nodes at distance k — 1 from node 1, but are not part of an edge connected 
to a node at distance k — 2. We refer to such stubs as 'free stubs'. Thus, Zj}''^^ is the number of 
outgoing stubs from nodes at distance k — 1. By SPGfc_i we denote the SPG up to level k — 1, i.e., 
up to the moment we have Zj^''^^ free stubs attached to nodes on distance k — 1, and no stubs to 
nodes on distance k. Since we compare Zj^'^^ to the k^^ generation of the BP Zj^\ we call Zj^'^^ 
the stubs of level k. 

The first stages of a realization of the generation of the SPG, with N = 9 and Lpf = 24, are 
drawn in Figure Ol The first line shows the A^ different nodes with their attached stubs. Initially, 
all stubs have label 1. The growth process starts by choosing the first stub of node 1 whose stubs 
are labeled by 2 as illustrated in the second line, while all the other stubs maintain the label 1. 
Next, we uniformly choose a stub with label 1 or 2. In the example in line 3, this is the second 
stub from node 3, whose stubs are labeled by 2 and the second stub by label 3. The left hand side 
column visualizes the growth of the SPG by the attachment of stub 2 of node 3 to the first stub 
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Figure 3: Schematic drawing of the growth of the SPG from the node 1 with = 9 and the 
updating of the labels. 



of node 1. Once an edge is established the pairing stubs are labeled 3. In the next step, again a 
stub is chosen uniformly out of those with label 1 or 2. In the example in line 4, it is the first stub 
of the last node that will be attached to the second stub of node 1, the next in sequence to be 
paired. The last line exhibits the result of creating a cycle when the second stub of the last node 
is chosen to be attached to the first stub of node 3, which is the next stub in the sequence to be 
paired. This process is continued until there are no more stubs with labels 1 or 2. In this example, 
we have Z^'^^ = 3 and ^2^'^' = 6. 

We now describe the meaning of the labels. Initially, all stubs are labeled 1. At each stage of 
the growth of the SPG, we draw uniformly at random from all stubs with labels 1 and 2. After 
each draw we will update the realization of the SPG, and classify the stubs according to three 
categories, which will be labeled 1, 2 and 3. These labels will be updated as the growth of the SPG 
proceeds. At any stage of the generation of the SPG, the labels have the following meaning: 

1. Stubs with label 1 are stubs belonging to a node that is not yet attached to the SPG. 

2. Stubs with label 2 are attached to the shortest path graph (because the corresponding node 
has been chosen), but not yet paired with another stub. These are called 'free stubs'. 

3. Stubs with label 3 in the SPG are paired with another stub to form an edge in the SPG. 

The growth process as depicted in Figure 01 starts by labelling all stubs by 1. Then, because 
we construct the SPG starting from node 1 we relabel the Di stubs of node 1 with the label 2. We 
note that Z[^''^^ is equal to the number of stubs connected to node 1, and thus Z[^''^^ = Di. We 
next identify Zj^'^' for j > 1. Zj^''^' is obtained by sequentially growing the SPG from the free 
stubs in generation Zj^^'. When all free stubs in generation j — 1 have chosen their connecting 
stub, Zj^''^' is equal to the number of stubs labeled 2 (i.e., free stubs) attached to the SPG. Note 
that not necessarily each stub of Zj^^' contributes to stubs of Zj^''^\ because a cycle may 'swallow' 
two free stubs. This is the case when a stub with label 2 is chosen. 
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When a stub is chosen, we update the labels as follows: 



1. If the chosen stub has label 1, in the SPG we connect the present stub to the chosen stub to 
form an edge and attach the remaining stubs of the chosen node as children. We update the 
labels as follows. The present and chosen stub melt together to form an edge and both are 
assigned label 3. All 'brother' stubs (except for the chosen stub) belonging to the same node 
of the chosen stub receive label 2. 

2. In this case we choose a stub with label 2, which is already connected to the SPG. For the 
SPG, a self-loop is created if the chosen stub and present stub are 'brother' stubs which 
belong to the same node. If they are not 'brother' stubs, then a cycle is formed. Neither a 
self-loop nor a cycle changes the distances to the root in the SPG. 

The updating of the labels solely consists of changing the label of the present and the chosen 
stub from 2 to 3. 

The above process stops in the j^^ generation when there are no more free stubs in generation 
j - 1 for the SPG. 

We continue the above process of drawing stubs until there are no more stubs having label 1 or 
2, so that all stubs have label 3. Then, the SPG from node 1 is finalized, and we have generated 
the shortest path graph as seen from node 1. We have thus obtained the structure of the shortest 
path graph, and know how many nodes there are at a given distance from node 1. 

The above construction will be performed identically from node 2, and we denote the number 
of free stubs in the SPG of node 2 in generation k by Zj^''^\ This construction is close to being 
independent. In particular, it is possible to couple the two SPG growth processes with two inde- 
pendent bp's. This is described in detail in |^ Section 3]. We make essential use of the coupling 
between the SPG's and the BP's, in particular, of j2H Proposition A. 3.1] in the appendix. This 
completes the construction of the SPG's from both node 1 and 2. 

3.1 Bounds on the coupling 

We now investigate the growth of the SPG, and its relationship to the BP with law g. In its 
statement, we write, for i = 1,2, 

F^'.^) = (r - 2)" log(Z^''^' V 1) and F^^' = (r - 2)" log(Z^'' V 1), (3.3) 

where {^j^'}j>i and {^j^'}j>i are two independent delayed BP's with offspring distribution {g} 
and where -Zj'' has law {/}. Then the following proposition shows that the first levels of the SPG 
are close to those of the BP's: 

Proposition 3.1 (Coupling at fixed time) For every m fixed, and for i = 1,2, there exist in- 
dependent delayed BP's Z^^\Z'-^\ such that 

hm FiY^''^ = y^) = 1. (3.4) 

A'^oo 

In words. Proposition 13.11 states that at any fixed time, the SPG's from 1 and 2 can be coupled 
to two independent BP's with offspring g, in such a way that the probability that the SPG differs 
from the BP vanishes when N ^ oo. 

In the statement of the next proposition, we write, for i = 1, 2, 

rj;'-)=r^--'(e) = {k>m: {z^^^f"'"^ <n-^} 

= {k>m: k'^YJ;-''^ < log N}, (3.5) 

T — 1 
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where we recall that k = (r — 2) ^. 

We will see that Z^!''^^ grows super-exponentially with k as long as A; G Tm'^^ ■ More precisely, 

(Zm^')'' is close to Z^i!'^\ and thus, Tm'^'' can be thought of as the generations for which the 

generation size is bounded by N ^-'^ . The second main result of the coupling is the following 
proposition: 

Proposition 3.2 (Super-exponential growth with base Ym'^^ for large times) // F satis- 
fies Assumvtion M . 11 then for i = 1,2, 

(a) Pie<Y^'''^ <e-\ max |y,(^'^' - y^-'^'j > e^) = (3.6) 

(6) P(. < y^^'-) < e-\ 3k G T^'-\e) : Z^' > 4^'"') = o^M^), (3.7) 
ne < Y^''^ < e-\ 3k E r^-'^'(e) : Z^'''^' > iV^) = o,.,„,,(l), (3.8) 

where Ojv,m£(l) denotes a quantity 'yN,m,£ that converges to zero when first N 00, then m ^ 00 
and finally e | 0. 

Proposition 13.21 (a), i.e., (|3.H|) . is the main coupling result used in this paper, and says that as 
long as G Tm'^\e), we have that 1^*''^' is close to Ym''^\ which, in turn, by ProDosition l3.H is close 
to Ym . This establishes the coupling between the SPG and the BP. Part (b) is a technical result 
used in the proof. Equation ()3.7() is a convenient result, as it shows that, with high probability, 

k Zj!''^^ is monotone increasing. Equation (|3.8j) shows that with high probability Z^*'^' < N ^-1 
for all k E Tm''^\s), which allows us to bound the number of free stubs in generation sizes that are 
inT^''"'(e). 

We complete this section with a final coupling result, which shows that for the first k which is 
not in Tm''^\£), the SPG has many free stubs: 

Proposition 3.3 (Lower bound on Z^'_j_'^' for k + 1 ^ Tm'^\£)) Let F satisfy Assumption M . R 
Then, 

F{k G T^^'^^^e), k + 1^ T^-''\e),e < y^-'^' < e-\ Zl^:^ < iV^) = o^,™,,(l). (3.9) 

Propositions 13. H 13.21 and 13.31 will be proved in the appendix. We now prove the main results in 
Theorems 11.21 and 11.51 sub iect to Propositions 13.11 13.21 and 13.31 in Section 0J 

4 Proof of Theorems 11.21 and 11.51 

In this section we prove Theorem 11.21 and identify the limit in Theorem 11.51 using the coupling 
theory of the previous section. For i = 1,2, we recall that Zj''^' is the number of free stubs 
connected to nodes on distance j — 1 from root i. As we show in this section, the hopcount H^^ is 
closely related to the SPG's {Z^''''^}j>o,i = 1,2. 

4.1 Outline of the proof 

We start by describing the outline of the proof. The proof is divided into several key steps proved 
in 5 subsections. 

In the first key step of the proof, in SectionESl we split the probability F(i^jv > k) into separate 
parts depending on the values of Ym'^^ = (r — 2)™ log{Zm^^ V 1). We prove that 

F{Hr, > k, YJ^'^'^Y^'"'^ = 0) = l-ql + o(l), iV ^ 00, (4.1) 
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where 1 — is the probabihty that the delayed BP {Zj }j>i dies at or before the m generation. 
When m becomes large, then — > q, where q equals the survival probability of the BP {Zj^''}j>i. 
This leaves us to determine the contribution to P(i/]v > k) for the cases where ll^'^'ll^''" > 0. 
We further show that for m large enough, and on the event that Ym'^^ > 0, whp, Ym^^ € [e,e~^], 
for i = 1,2, where e > is small. This provides us with a priori bounds on the shortest path graph 
exploration processes {Zj'"^-'}. We denote the event where Ym^^ G [e, for i = 1,2, by £'m,jv(£)- 
The second key step in the proof, in Section r4.31 is to obtain an asymptotic formula for P({iifjv > 
k} n Em^N^^))- Indeed, we prove the existence of A = Ajv(A;) > such that 

H^>k}n EmA^)) = e[i£;„_^(,) exp I - A^i±i-^}] , (4.2) 

where the right-hand side is valid for any ki with < 2ki < k — 1, and where A = Ajv(fc) satisfies 
5 < < 4:k. It is even allowed that ki is random, as long as it is measurable w.r.t. {Zj ' '}^]^. 

Even though the estimate on Xpf is not sharp, it turns out that it gives us enough information to 
complete the proof. The bounds ^ < Ajv(A;) < 4k play a crucial role in the remainder of the proof. 

In the third key step, in Section [4.41 we show that, for k = kpf ^ oo, the main contribution of 
(|4.2)) stems from the term 



E 



E^Me) -P { - A min ^l±y^}] , (4.3) 

with = BN{£,kfi) defined in ()4.5U|) and is such that ki G BM{£-,ki^) precisely when A;i + 1 S 
Tm'^\£) and kj^ — ki € 7^^'^'(e). Thus, by Proposition 13.21 it implies that whp 

4';:i<N'^ and <iV^. 

In turn, these bounds allow us to use Proposition I3.2r a). 

In the fourth key step, in Section [4.51 we proceed by choosing 

^°g^°g^ +1 (44) 
log(r-2)|J+'' ^^-^^ 

and we show that with probability converging to 1 as e | 0, the results of the coupling in Proposition 
ESI apply, which imphes that Y^l'^l « Y^'""^ and Y^l;\ ^ Y^'^'K 

In the final key step, in Section [4.61 the minimum occurring in (|4.3|) . with the approximations 
'^kl+i ~ ^m'^^ and Y^^^'^^f^^ « Ym''^\ is analyzed. The main idea in this analysis is as follows. With 
the above approximations, the expression in (|4.3|) can be rewritten as 

e\1e^ ^(,) exp I - Aexp [ min y^+'Y^'''^ + k'^-'^Y^-'^^) - logL J |j + o^,^,,(l), (4.5) 

where k = (r — 2)~^ > 1. The minimum appearing in the exponent of 1)4. 5|1 is then rewritten (see 
and (jmH)) as 

Since kT'^^/^I — > 00, the latter expression only contributes to 1)4. 5|) when 

min(K*y(^'^' + K^'-*y('''^') - K-T'^^/^i 11^^ < 0. 

Here it will become apparent that the bounds on Xpfik) are sufficient. The expectation of the 
indicator of this event leads to the peculiar limit 



mm 



(K<y(i) +KCi-ty(2)) < ^a^~R/2l^y(i)y(2) > 0^ , 



with Ojv and q as defined in Theorem 11.21 We complete the proof by showing that conditioning on 
the event that 1 and 2 are connected is asymptotically equivalent to conditioning on ^ g_ 
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Remark 4.1 In the course of the proof, we will see that it is not necessary that the degrees of the 
nodes are i.i.d. In fact, in the proof below, we need that Propositions \S. lUS. .'^ are valid, as well as 
that Ljv is concentrated around its mean fiN. In Remark \A.l.^ in the appendix, we will investigate 
what is needed in the proof of Propositions \3. In particular, the proof applies also to some 

instances of the configuration model where the number of nodes with degree k is fixed, when we 
investigate the distance between two uniformly chosen nodes. 

We now go through the details of the proof. 
4.2 A priory bounds on Y^'^' 

We wish to compute the probability P(i?jv > A;). To do so, we split P(/^jv > k) as 

F{H^ >k)= F{H^ > k, YJ^-^'^YJ^-''^ = 0) + F{H^ > k, il^'^'y^''"" > 0), (4.6) 

where we take m to be sufficiently large. We will now prove two lemmas, and use these to compute 
the first term in the right-hand side of (|4.(ij) . 

Lemma 4.2 

lim nY^'-^Y^^-^ =0) = l-ql, 
JV— >oo 

where 

q^=F{Y^^>0). 

Proof. By Proposition for N oo, and because Ym^ and Ym^ are independent, 

P(y4^'-'y4^'-' = 0) = P(y^^^yr = o) + o(i) = i - fiy^'y^^ > o) + o(i) (4.7) 
= 1 - p(y« > o)p(yr > 0) + oil) = i-qi + o(i). 

□ 

The following lemma shows that the probability that H^f < m converges to zero for any fixed 

m: 

Lemma 4.3 For any m fixed, 

lim P(i7jv < m) = 0. 

Proof. As observed above Theorem 11.21 by exchangeability of the nodes {1, 2, . . . , A^}, 

P(i?jv <m)= P(Fjv < m), (4.8) 

where Hj^ is the hopcount between node 1 and a uniformly chosen node unequal to 1. We split, 
for any < 6 < 1, 

P(^iv <m)= F{Hj^ <m,Y^ Zf'''^ < N^) + ¥{H^ < m, ^ Zf'''^ > N^). (4.9) 

The number of nodes at distance at most m from node 1 is bounded from above by Ylj<m ^j^ '^^ ■ 
The event {i/jv < m} can only occur when the end node, which is uniformly chosen in {2, . . . , N}, 
is in the SPG of node 1, so that 

<m,Y, Z'r' < N') < = o(l), iV ^ oo. (4.10) 

j<m 



13 



Therefore, the first term in H4.9|) is o(l), as required. We will proceed with the second term in 
(|4.9)) . By Proposition 13.11 whp, we have that Yj^''^^ = Yj^'' for all j < m. Therefore, we obtain. 



because Yj^'"^^ = Yj^^ implies Zj^''^' = 



F{H^ < m, ^ Zj''^' > N^) < P( Zj''^' > N^) = Z'^^ > N^) + o(l). 

j<m i^Ti 

However, when m is fixed, the random variable ^j<„ Zj^^ is finite with probability 1, and therefore. 



lim F(Hj^ < m, V '"^^ > N^) = 0. (4.11) 

j<m 

This completes the proof of Lemma 14.31 □ 



We now use Lemmas 14.21 and 14.31 to compute the first term in (|4.6|) . We split 

FiH^ > k, y^^'^'^l''"' = 0) = P(y4^'">y4^'") = o) - f(//^ < k, y^^^'^y^^''^ = o). (4.12) 

By Lemma l4.2| the first term is equal to 1 — (7^ + o(l). For the second term, we note that when 
y^i.iv) _ Q g^j^j ^ then Hpj < m — 1, so that 

P(F^ < k, y^^-'^'y^^-'^) = 0) < P(//^ < m - 1). (4.13) 
Using Lemma l4.3| we conclude that 
Corollary 4.4 For every m fixed, and each A; G N, 



hm P(F^ > k, Y^'^'^Y^'''^ = 0) = 1 - gi 



By Corollary EH and (gSI), we are left to compute ^{Hj, > k^Y^'^'^Y^n'"'^ > 0). We first prove 
a lemma that shows that if 1^^'^' > 0, then whp 1^^'^^ G [e,e~^]: 

Lemma 4.5 For i = 1,2, 

lim sup lim sup lim sup P(0 < Yj^'^^ < £) = lim sup lim sup lim sup P(y^*'^' > e~'^) = 0. 

Proof. Fix m, when A*" ^ 00 it follows from Proposition 13. II that Ym'^^ = Ym , whp. Thus, we 
obtain that 

lim sup lim sup lim sup P(0 < y^'''^^ < e) = lim sup lim sup P(0 < Y^^ < e), 

e|0 m^oo Af— >oo e|0 m^oo 

and similarly for the second probability. The remainder of the proof of the lemma follows because 
Ym — > y'*' as m ^ 00 and is hence a tight sequence. □ 

Write 

Em,N = EmA^) = {Y^'""^ G [e,e'^],i = 1,2}, (4.14) 
Fm,N = FmA^) = { max lY^''""^ - y^^'^'l < e^^i = 1,2}. (4.15) 

As a consequence of Lemma 14.51 we obtain that 

FiE^^^^ n {y^^''^>y^^''^' > 0}) = o^,„,,(l). (4.16) 

In the sequel, we compute 

F{{H^>k}nEra,M), (4.17) 
and often we make use of the fact that by Proposition 13.21 

F{Em,N n F^A = o^,^,,(l). (4.18) 
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4.3 Asymptotics of ^{{H^ > k} n E,n,N) 

We next give a representation of P({i/jv > k} r\ Em,N)- In order to do so, we write Qz'^\ where 
hj > 0, for the conditional probabihty given {Zi^'^^jl^^ and {-^i^^^'j^^i (where, for j = 0, we 
condition only on {.^i^'^'}*^^), and E^'^* for its conditional expectation. Furthermore, we say that 
a random variable ki is Zm-measurable if ki is measurable with respect to the a-algebra generated 
by {Zi^'^'}^]^ and {Zi^'^'}^^- The main rewrite is now in the following lemma: 



Lemma 4.6 For k > 2m — 1, 

¥{{H^ >k}n Em,N) = E 



1e 



j,(m,m) 



(i^;v>2m-l)P^(fc,fci; 



(4.19) 



where, for any Zj^-measurable ki, with m < ki < {k — l)/2, 



2fei 



Pm(A;,A;i) = n Qr^J+^'r^/^l)(i7^ > > z - 1) 



(4.20) 



i=2m 



k-2ki 



Yl Qi''+'^''^+'\H^ > 2ki + i\H^ >2ki+i- 1). 



i=l 



Proof. We start by conditioning on {Zi^'''^}f^-^ and {Zf '"^^j^i, and note that E^, jv is measurable 
w.r.t. and {Zi^'^^jf^^, so that we obtain, for k>2m-\, 



= E 



(i^N > A:) 



(4.21) 



lE^^q^r'"'\H^ > 2m - l)Qr'"''(i?iv > A;|il^ > 2m - 1) 



{m,m) / 



Moreover, for i,j such that ? + j < fc, 



Qr'(i7^ > A;|i?^ >i + j-l) 
= Eg'^'' [Qg'^'+^H^iv > > i + i - 1)] 

= Eg'^' [Q^-^+^'(i/;v > i + jW^v > i + j - lW^'^'\H^ > k\H^ > i + j)], 



(4.22) 



and, similarly. 



{H^>k\H^>i + j-l) 



(4.23) 



(iJ;v > i + j\H^ >i + j- mT''^ {H^ > k\H^ > i + j)] . 



When we apply the above formulas, we can choose to increase i or j by one depending on {Zi^'^'}*^^ 
and {Zs'^''^''}l^^. We iterate the above recursions until i + j = k — 1. In particular, we obtain, for 
A; > 2m - 1, 



^(m,m) 



(i?^ > k\H,^ > 2m - 1) = E 



(m,m) 
Z 



jj{m+l,m) 



(iljv > 2m|ffjv > 2m - 1) (4.24) 



N(m+l,Tn) 



(iJ^ > k\Hj^ > 2m) 



so that, using that -Em,jv is Q^'™' -measurable and that E[E2 [X]] = E[X] for any random variable 
¥{{H^ > fe} n Em,N) (4.25) 



= E 



lE^,^QL"^''"'(^iv > 2m - l)Qr+^'"'\i?^ > 2m\H^ > 2m - l)Qr+^'"'\iJ^ > k\H^ > 2m) 
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We now compute the conditional probability by increasing i or j as follows. For i + j < 2ki, we 
will increase i and j in turn by 1, and for i + j > 2ki, we will only increase the second component 
j. This leads to 



jj(m,m) 



2ki 

{H^ > k\H^ > 2m - 1) = EL"'"^ [ J] Q<,LV2j+i.rv2i)(^^ > -j^^ > i - 1) 



(4.26) 



i=2m 
k-2ki 



n Q^^'-'^'-^'C^^ > 2ki + j\H^ > 2ki 



K-'"''[Pn.{k,ki)]. 



Here, we use that we can move the expectations E^'^' outside, as in (|4.25|) . so that these do not 
appear in the final formula. Therefore, from (|4.2H) and (|4.26|) . 



F{{H^ >k}n Em,, 



H^>2m-l)E^P"'^[Pmik,ki)] 



E™[ls„.^Qr™'(i^^ > 2m-l)Pmik,ki)] 



> 2m-l)Pm{k,ki) 



(4.27) 



This proves 

We note that we can omit the term 
term. Indeed, we can write 



r,(m,m) 



□ 

-fTjv > 2m — 1) in 1)4.19(1 by introducing a small error 



ft"\H^ > 2m - 1) = 1 - Q^t"\H^ < 2m - 1). (4.28) 



For the contribution to (|4.19|) due to the second term in (|4.28|) . we bound j^Pmik, ki) < 1. 
Therefore, the contribution to (|4.19() due to the second term in (|4.28p is bounded by 



E 



qi"''"'\Hj, < 2m - 1) = F{Hj^ < 2m - 1) = Ojv(l), 



(4.29) 



by Lemma 14.31 

We conclude that by (g^H), (|^rTH|) and 



'{{H^ >k}nE„ 



E 



(1), 



(4.30) 



where we recall (|4.2()|) for the conditional probability Pmik,ki) appearing in (j4.3()j) . 



We continue with (|4.30() by investigating the conditional probabilities in Pmi^, ^i) defined in (|4.2UI) . 



We have the following bounds for 



{H^ > i + j\H^ >i + j -I): 



Lemma 4.7 For all integers i,j > 0, 



exp 



A 7(1, iV) 7(2, iV) 

Lm 



< 



i/jv > i + j\HM > ^ + j — 1) < exp 



7(1, JV) 7(2. AT) 

2Liv 



The upper bound is always valid, the lower bound is valid whenever 



i+l 



(4.31) 



s=l 



s=l 
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Proof. We start with the upper bound. We fix two sets of rii and 712 stubs, and wih be interested 
in the probability that none of the rii stubs are connected to the n2 stubs. We order the ni stubs 
in an arbitrary way, and connect the stubs iteratively to other stubs. Note that we must connect at 
least stubs, since any stub that is being connected removes at most 2 stubs from the total 

of ni stubs. The number ni/2 is reached for ni even precisely when all the ni stubs are connected 
with each other. Therefore, we obtain that the probability that the ni stubs are not connected to 
the n2 stubs is bounded from above by 

["1/21 

n 1-^^?-Tt)- (4.32) 



1=1 

To complete the upper bound, we note that 



Ljv - 2i + 1 



no no 

JvjY -|- i JvjY 

to obtain that the probability that the ni stubs are not connected to the 77-2 stubs is bounded from 
above by 

e < e . (4.34) 

Applying the above bound to ni = Z'^^'^'' and n2 = Zj^''^\ and noting that the probability that 
i^jv > i + j given that Hj^ > i + j — 1 is bounded from above by the probability that the stubs in 
^i+f' are not connected to the stubs in Zj^'^^ leads to 

- '""^^j \ , (4.35) 

which completes the proof of the upper bound. 

We again fix two sets of ni and n2 stubs, and are again interested in the probability that none 
of the rzi stubs are connected to the n2 stubs. However, now we use these bounds repeatedly, and 
we assume that in each step there remain to be at least M stubs available. We order the ni stubs 
in an arbitrary way, and connect the stubs iteratively to other stubs. We obtain a lower bound by 
further requiring that the ni stubs do not connect to each other. Therefore, the probability that 
the ni stubs are not connected to the 71-2 stubs is bounded below by 



n 



n2 



M -2i + l 



(4.36) 



When M - 2ni > ^,we obtain that 1 - > 1 - x^- Moreover, when x < i, we have that 

1 — X > e~^^. Therefore, we obtain that when M — 2ni > and n2 < then the probability 
that the ni stubs are not connected to the n2 stubs when there are still at least M stubs available 
is bounded below by 



ni 



i=l i=l 

The event Hj^ > i + j conditionally on H^^ > i + j — 1 precisely occurs when the stubs 
^i+i^ are not connected to the stubs in Zj^''^\ We will assume that 1)4. 31(1 holds. We have that 

kf = L^-2nt\4^' 
happens precisely when 



M = -2Zit\ Z's''""' - 2Ei=i4'''"', and m = n2 = Zf'^'K Thus, M - 2ni > ^ 



M - 2ni > - 2 ^ _ 2 ^ Zf > — . (4.38) 



=1 s=l 
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This follows from the assumed bound in H4.31() . Also, when n2 = Z^^''^\ n2 < is implied by 
(|4.31|1 . Thus, we are allowed to use the bound in H4.37() . This leads to 

Qr^'^H^^ > i + j\HN > i + j - 1) > exp I ^i-^ 1, (4.39) 

which completes the proof of Lemma 14.71 □ 
4.4 The main contribution to F{{Hpf > A;} fl Em,N) 

We rewrite the expression in ()4.3U|) in a more convenient form, using Lemma 14.71 We derive an 
upper and a lower bound. For the upper bound, we bound all terms appearing on the right hand 
side of (ICTl) by 1, except for the term Qlj*=i+'''="*i'(ffjv > k\Hr, > k - 1), which arises when 
i = k — 2ki. Using the upper bound in Lemma 14.71 we thus obtain that 

Pmik,ki) < exp{ - ^i±l_^|. (4.40) 

The latter inequality is true for any Z^-measurable ki with m < ki < (k — l)/2. 
To derive the lower bound, we next assume that 

fci+l k—ki J 

Y.Z^r'+Y^Zr^K'f, (4.41) 

s=l s=l 

so that (jOTj) is satisfied for all i in (|On|) . We write, recalling 

B^^\e,k) = ^m<l<{k-l)/2: / + 1 G Z^^-^'(e), - Z € T^^'^^He)}- (4-42) 

We restrict ourselves to ki G B^^\e,k), if B^^\e,k) / 0. When ki G B''^\e,k), we are allowed 
to use the bounds in Proposition 13.21 Note that {ki G B%\e,k)} is Z^— measurable. Moreover, it 
follows from Proposition 13.21 that if ki G i3jv'(e,/c), that then, with probability converging to 1 as 
first N ^ CO and then m oo, 

Zf-'^)<iV^, Vm<s<A:i + l, while Zf'^' < iV^, 'inKsKk-ki. (4.43) 

Therefore, when ki G B^^\e,k)^ the assumption in (|4.41|) is satisfied with probability 1 — Ojv,m(l)) 
as long as k = O(A^^-i). The latter restriction is not serious, as we always have k in mind for 
which k = O (log log A'^) (see e.g. Theorem ll.2|) . 

Thus, on the event Em,N H {ki G B^^\e, k)}, using (|3.7() in Proposition 13 . 21 and the lower bound 
in Lemma W7j\ with probability 1 — OM,m,e{^)-, and for all i G {2m, . . . , 2/c — 1}, 

Qav2j+i.rv2i)(^^ > i\H^ > i - 1) > exp { Li/2j+i rV2l | > | _ fc-fci }^ (4.44) 

and, for \ < i < k — 2ki, 

^Ck,+iM+^)^jj^ > 2ki + i\H^ >2ki+i-l)>exp{ fci±LJ^} > exp { fci±L^^|_ 

(4.45) 

Therefore, by Lemma 14.61 and using the above bounds for each of the in total k terms, we 
obtain that when ki G ^^^(e, k) 7^ 0, and with probability 1 — Oiv,m,e(l)) 

Pm{k,ki) > exp{ - 4A;^i±p^}. (4.46) 
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We next use the symmetry for the nodes 1 and 2. Denote 

B'-^\e,k) = {m<l<ik- l)/2 : / + 1 G T^^'-'(e), k-lG U'^'^Ke)}. (4.47) 

Take / = — / — 1, so that (k — 1) /2 < I < k — 1 — m, and thus 

B''^\e,k) = \^{k-l)/2<l<k-l-m: [+ 1 G T^^-'^)(e), A; - [ G T^^-'^'(e)}. (4.48) 

Then, since the nodes 1 and 2 are exchangeable, we obtain from (|4.46)) . when ki G B^^\e,k) ^ 0, 
and with probability 1 — Ojv,m,e(l), 

Pm{k,ki) > exp{-4fc (4.49) 

Lin 

We define B^ie, k) = ^^'(6, A;) U B^^\e, k), which is equal to 

Br,ie,k) = {m</<A:-l-m: / + lG r^'''^>(e), k-l£ r^^-'^'(e)}. (4.50) 

We can summarize the obtained results by writing that with probability 1 — Oi^^m. ,^(1), and when 
Bff{£, k) ^ 0, we have 

Pmik,k,)=exp{-XN "'+1 (4.51) 
for all ki G BN{e,k), where = Ajv(A;) satisfies 

^ < Ajv(A;) < 4:k. (4.52) 

Relation ()4.51|) is true for any ki G Bff{£,k). However, our coupling fails when -^^.^^'^j or Z^^^^ 

grows too large, since we can only couple Zj''^' with Zj""^^ up to the point where Zj''^' < N ^-i . 
Therefore, we next take the maximal value over ki G Bi^{e,k) to arrive at the fact that, with 
probability 1 — Ojv,m,e(l), on the event that B^i^jk) / 0, 

Pm{k, ki) = max exp | — Ajv '^^^^ — k^±i_~\ _ \ _ \^ ^[j^ — fe:^ I (4.53) 

We conclude that 

HN>k}n Err,,N n {S^(e, /c) / 0}) (4.54) 

7(1, JV) 7(2, JV) 

lEn.,iv exp <^ - Ajv mm \ +o^,^,e(l). 



E 



From here as in 1)4.41) with / a fixed integer. 

In Section |51 we prove the following lemma that shows that, apart from an event of probability 

1 — 0]v,m,e(l)) we may assume that Bm[s-, kff) 7^ 0: 

Lemma 4.8 For all I, with k^ as in j^.^l ), 

limsuplimsuplimsupP({i?jv > ^iv} ^Em,N H {;Sjv(e, fejv) = 0}) = 0. 

£j,0 m^oo N^oo 

From now on, we will abbreviate Bf^ = Bpf{£, fcjv)- Using 1)4.54(1 and Lemma 14.81 we conclude 
Corollary 4.9 For all I, with kj^ as in i4-4\ )i 

F{{Hn > k^} n E^^r,) = ^ ^EmN exp \-Xm min + o^,™,,(l). 
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4.5 Application of the coupling results 

In this section, we use the coupling results in Section 13.11 Before doing so, we investigate the 
minimum of the function t K^yi + K^~^y2, where the minimum is taken over the discrete set 
{0, 1, . . . ,n}, and we recall that k = (r — 2)""'^. 

Lemma 4.10 Suppose that yi > y2 > 0, and k = (r — 2)^^ > 1. For the integer n > — 

t = argmmig|o,i,...,n} ('^ yi + ?/2) = round I - + ^logK )' 
where round{x) is x rounded off to the nearest integer. In particular, 

max < — — , — — > < 

[ * y2 yi J 

Proof. Consider, for real valued t E [0,n], the function 
Then, 

= - K"-*y2) log K, rit) = {K'yi + K^-'y^) log^ k. 

In particular, > that the function ^ is strictly convex. The unique minimum of is 

attained at t, satisfying ip'{t) = 0, i.e., 

2 2 log K, 

because n > — log(y2/yi)/logK. By convexity t* = \ i\ or t* = \t\. We will show that |t* — t| < |. 
Put = [i\ and = \t\ . We have 

= K""*y2 = K^Vmrn- (4.55) 
Writing t* = t + t* - i, we obtain for z = 1, 2, 

For < < 1, the function x + is increasing so < ^'(^2) ^'^cl only if t — < t\ — t^ 

OT i — t\ < ^, i.e., if ip{tl) < "0(^2) ^'^d hence the minimum over the discrete set {0, 1, . . . ,n} is 
attained at t^, then i — t\<^. On the other hand, if V'(^2) — then by the 'only if statement 

we find t2 — i < ^. In both cases we have |t* — £| < ^. Finally, if t* = tj, then we obtain, using 

1 < — Tir^ = = K ^ 1^ < K, 

yi K*^-* 

while for t* = t%, we obtain 1 < !;* < k. □ 

We continue with our investigation of P({//jv > fcjv} H Em,N) ■ We start from Corollarv 14.91 substi- 
tuting (lOl) . 

P({/7^ > fc^} n (4.56) 



E 



1^^,, exp { - A^exp [^min^ (K^^^+^n'^'i + ^'""'^^'^-1,) " ^^gL^] } 



+ Ojv,m,e(l7) 
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where we rewrite, using (|4.5U|) and H3.5|) . 



Br, = {m < ki < kr, - 1 - m : k'^^+^YJ^'"'^ < 



T - 1 



T-1 



logiV}. (4.57) 



Moreover, according to Proposition 13.21 and with probabiUty at least 1 — Ojv,m,e(l)) we have that 
minfc.eB^ {k^'^+^Y^I'+I + '^''''''''Y^'^\ ) is between 



min U^'+\YJ;^'''^ - e^) + K^^-^^iY^ 



and 



min (k^'+\Y^^^'''^ + e^) + k''^'^' {Y^''''> + e^)). 



To abbreviate the notation, we will write, for i = 1,2, 

_ Y(i,N) I 3 T^{»,iV) _ Y(i,N) 

m,+ ' 1 ^m,— 

Define for e > 0, 



(4.58) 



Hm,N — Hjyi,n{^) 



mm ( K 

0<fci<fcjv-i 



+ K^^-'^iy^'-^') < (1 + e') logiV !> . 



On the complement H^^j^, the minimum over < ki < kj^ — 1 of k^'^ Y^ '_ + k Y^'_ exceeds 
(1 + e^) log A^. Therefore, also the minimum over the set of k'^'^'^^YJ:^'^^ + k'^^^'^^Y^'^^ exceeds 
(1 +£^)logN, so that, using Lemma 14.71 and with error at most Ojv,m,£(l)) 

{-\ 

{-^exp(( 



< E 

< E 

< E 



exp 



exp 
exp 



min (Ac'^i+iy'^f ' + K'^N-k^yi^.N) ^ _ J ^ 



min {.'^^'Y^:^^ + ."--'^^Y^:^^) 
LfeieBiv 



log 



1 + e^)logiV-logL, 



')} 



< e 2, 



(4.59) 



because L^v < cA^, whp, as ^ oo. Therefore, in the remainder of the proof, we assume that 
Hm,N holds. 

We next show that whp , 



mm K 



irn 4- ) = mni I 

' 0<ki<kN 



ki+lv(i,N) ^kM-kiv{2,N) 



Y^(2,iV)^ 
m,+ / ' 



(4.60) 



and 



min U'^^+^Y^'^' + K'^^-'^iy'f'!") = min U'^^+^Y^'^^ + Ac'^'^-'^iy'^'^^ . 



(4.61) 



We start with (|4.60() . the proof of ()4.61|) is similar, and, in fact, slightly simpler, and is therefore 
omitted. To prove (|4.6U|) . we use Lemma [4.1U1 with n = k^j + 1, t = ki + 1, yi = Y^ 



and 



2/2 = 1; 



(2,JV) 



Let 



argmm 



and assume (without restriction) that k yi > k" y2- We have to show that t* — 1 £ B 
According to Lemma l4.1()( 



1 < 



K--'*y2 



< K. 



(4.62) 
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We define x = k^*Y^'^^ and y = *'Y^'^', so that x > y. By definition, on Hm,N{£), 



k''Y^:^^ + < (1 + e^) log A^. 



Since, on Em,N, we have that Ym' > £ 

Y 



e — e 



Y^ 

3 m 



1-e 



-Y 

2 m,— ' 



(4.63) 



and likewise for Y^'^ . Therefore, we obtain that on Em,N H Fm,N H Hm,N, and with e sufficiently 
small, 



1 + e' 



2\2 



X + y < ^ [k* y^^;.-^ + < log iV < (l + .) log iV. 

Moreover, by H4.62() . we have that 



1 < - < K. 

y 



Hence, on E^.n H F^^n H H^^n, we have 



x + y 



1 



when e > is sufficiently small. We claim that if 



log N = log A^, 

T — 1 



x = ^**il^;f <^iogiV, 

then A;* = — 1 G Bm{s-, k^), so that (|4.6Up follows. Indeed, we use (|4.67p to see that 

^'=*+iya,iv) ^ ^t*^a,iv) < ^*-yj,^;f < i^logiV, 

so that the first bound in (|4.57j) is satisfied. The second bound is satisfied, since 

1-e 



^fc^~fc*y^2,iV) ^ ^n-t*y^2,iV) < ^n-** = y < x < log iV, 



' m 



m 



(4.64) 
(4.65) 

(4.66) 
(4.67) 
(4.68) 
(4.69) 



where we have used n = kpf + 1, and 1)4. 67(1 . 

Thus, in order to show that ()4.6U|) holds with probability close to 1, we have to show that the 
probability of the intersection of the events {H^f > kpf} and 



SmA^) = <j 3t : i-i log AT < K*y^,^;^' < i±i 



logiV, 



(4.70) 



cVj;;^ + K"-*y^^;f < (1 + e) log iv} , 



can be made arbitrarily small by choosing e close to 0, when first N ^ oo and then m — > oo. That 
is the content of the following lemma, whose proof is deferred to Section \^ 

Lemma 4.11 For kj^ as in j^.^l ), 

limsuplimsuplimsupP(£'m,jv(e) niSm,jv(e) H {Hpf > k^f}) = 0. 

ejO TO— >oo N—fOD 



Therefore, we finally arrive at 



< E 



l^^m iv exp <^ - Ajv exp 



min (K^^+iy^^f + ) 

0<fci<fcjv ' ' 



logLj, 



(4.71) 

+ Ojv,m,£(l)i 



and at a similar lower bound where Y^'^^ is replaced by 1^''^' 



' m,- 
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4.6 Evaluating the limit 



The final argument consists of letting N ^ oo and then m — > oo. The argument has to be performed 
with Y^'^^ and Y^'^^ separately, after which we let e | 0. Since the precise value of e plays no role 
in the derivation, we only give the derivation for e = 0. Observe that 



min {k^^+'Y^-^^ 

0<ki<kM 



logLf 



0<k 



.rfciv/21 



rin ( K'=i+l~rfciv/2lya,iV) + ^Lfeiv/2j-fciy^2.iV) _ ^-\k^/2] 
i<kff \ ) 

+ K-^'-V^^'^' - logL^), (4.72) 



mm 



-rfc]v/2]+l<t<[fciv/2j+l 



where t = /ci + 1 — [A;jv/2], c^^ = q = [//2j — [//2] + 1 = jg gyen}' further rewrite, using 
(|4.4I) and the definition of a^^ in Theorem ll.21 



^ log iv log iv ■ 

From Lemma I4.1U1 for N ^ oo and on the event Em^N, 

mm (^*y^^--)+K^-V^^'-')=min(^V^^-'+^ 
-rfcjv/2]+l<t<[fcjv/2j tez 



because Ym'^^ G [e,e ^] on Em,N- We define 



and, for e > 0, 

Fjv = Fjv(/,e) = {Tym,iv(A:jv) > e}, 
Observe that 



1^ <Kr^^/2i, 



We split 



P({/?^ > k^} n = E [l^;^,^ exp [ - A^e^^'^^'^^™.'-('=^)]1 + o^,™,e(l) 



where we define 



In 
Jn 



E 
E 
E 



exp[-A.e'^^^-^^^--(^-)]l,^,,_- 
(exp[-A.e'^^^-^^^^"^-('=-)]-l)l5^,,„^^^ 
exp[-A.e'^^^----(^-)]l,.,,.,,^^^; 



(4.73) 



(4.74) 



(4.75) 



Gr, = G^{l,e) = {W^,r,{kM)<-e}. (4.76) 



(4.77) 



(4.78) 

(4.79) 
(4.80) 
(4.81) 



We first show that /jv, Jn and Kf^ are error terms, and then prove convergence of P(Gjv n Em,N) 
We start by bounding Ijv- By the first bound in ()4.77p . for every e > 0, and since Ajv > ^, 



limsup/jv < limsupexp { — ^ expf/t'^'^'^/^^e}} = 0. 



(4.82) 
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Similarly, by the second bound in H4.77() . for every e > 0, and since Ajv ^ 4A;jV) we can bound iT/v cis 



lim sup I Jjv I < lim sup E 



1 -exp{ -4/c^rexp{-K^'=^/2^e}} =0. (4.83) 



Finally, we bound Kpf by 

Kj,<F{F'^nG%nE^^j,). (4.84) 

We will next show that 

P(Gjv n^„,jv) = P(mm(K*y(i> + K^'-*y(2>) - K-'^^-m] < o,y(i)y(2) > g) +o^,^,e(i), (4.85) 

and 

P(F^ n n ^„,^) = Or,,m,sil)- (4.86) 
Equation ()4.86p is the content of the following lemma, whose proof is deferred to Section [21 
Lemma 4.12 For all I, 

limsuplimsuplimsupP(Fjv(Z, ey n Gn{1, n Em,N{£)) = 0. 

elO m— >oo AT— ►oo 

We now prove (|4.85j) . From the definition of Gm^ 

G^nE^,^ = {mm(K*ya.-) + _^-a.-rV2ll^ < _,^Y^,N, ^ (4.37) 

By Proposition 13. II and the fact that Lpf = fJ,N{l + o(l)) with probability 1 — Ojv(l), 

F{G^ n Em,N) = P(mm(K*y^i> + k'^~'Y^^) - k^^^-^'/^'^ < -e,yW g [e,^-!]) + o^(l). (4.88) 

Since converges to y^'' almost surely, sup3>^ lyj'' — y''^| converges to a.s. as m — > 00. 
Therefore, 

F{G^nEm,N) = p[Mi - K-'^N-Hm < ^ + o^,^,,(l), (4.89) 
where we define 

Ml = min(K*y(^) + k^'-*^)). (4.90) 
Moreover, since Y'-^'' has a density on (0, co) and an atom at (see jl6j). 

P(y<^)0[e,e-i],y«>O) = o,(l), 

so that, in turn, 

F{G^ n S^,^) = f(^Mi - K-'^iv-ri/2l < y(i)y(2) > 0) + o^,^,,(l) 

= q^F(^Mi - K-'^^-r;/2l < > q) + o^.,^,e(l)• (4.91) 

Recall from Section[21that for any / fixed, conditionally on yf^^y'^) > 0, the random variable M/ 
has a density. We denote this density by /2 and the distribution function by F2. Also, k^^-n-IW^] g 
= [K-rV2l^^-R/2l+i]. Then, 

-e<Mi- K-'^^-rV2l < 0) < sup[F2(a) - ^2(0 - e)]. (4.92) 

The function F2 is continuous on //, so that in fact F2 is uniformly continuous on //, and we 
conclude that 

lim sup sup [F2 (a) - ^2(0 - s)] = 0. (4.93) 
We conclude the results of Sections l4.2H4.6l in the following corollary: 
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Corollary 4.13 For all I, with as in J^.^j j, 

¥{{Hj, > kj,} n {yWy(2) > O}) = - ^-'^^-^^21 < q y(i)y(2) > 0) + o^.,™,e(l). 



We now come to the conclusion of the proofs of Theorems 11.21 and 11.51 We combine the results in 
Corollaries 14.41 and 



kf 



log log A'" 



4.131 together with the fact that q-m = Q + o{l) as m ^ oo, to obtain that, with 

loi(r-2)| 

> kr,N + 1) = l-q'^ + q^F [Mi - k-^^-^'/^I < g 



y(i)y(2) >0 +o^,„,,,(l). (4.94) 



Therefore, 



'{H^ < kr,M + l)= q^¥ (j^^i^i > K""^"^'/'^ |y(i)y(2) > oj + o^,™,e(l). (4.95) 



When Z — > oo, we claim that (|4.95j) implies that, when iV — > oo, 

¥{Hj^ < oo) =q^ + o{l). 



(4.96) 



Indeed, to see (|4.96p . we prove upper and lower bounds. For the lower bound, we use that for any 
I e Z 

F{Hn < oo) >F {Hj, < kr,N + I) , 

and let / — > oo in 1)4. 95() . together with the fact that k""^"^'/^^ — > as / ^ oo. For the upper 
bound, we split 

P (//^ < oo) = P {{H^ < oo} n {Yj^'^'^Y^'''^ = 0}) + P {{H^ < oo} n {Y^'^'^Y^'"'^ > 0}) . 

The first term is bounded by P(-ffjv < m — 1) = Ojv(l), by Lemma 14.31 The second term is bounded 
from above by, using Proposition 13.11 



P {{H^ < oo} n \Y^-^'n^-^^ > 0}) < P {Y^^'^^Y^'^^ >0)=ql + o,(l), 



(4.97) 



which converges to as m ^ oo. This proves (|4.96|) . We conclude from (|4.95|) and (|4.96|) that 



Hn < oo 



(M/ > (t - 2)'^^ + r;/2l |y(i)y{2) > + o(l) 



This completes the proofs of Theorems 11.21 and 11.51 



(4.98) 

□ 



5 Proofs of Lemmas I4.8L 14.111 and 14.121 

In this section, we prove the three lemmas used in Section 0] The proofs are similar in nature. 
Denote 

{A; G dTi^n = {ke Ti^n n{k + l^ T^'}. (5.1) 
We will make essential use of the following consequence of Propositions 13. ll and 13.21 

Lemma 5.1 Assume that Propositions \S. l\ and \S. ^ hold. Then, for any u > 0, and i = 1,2, 

^{{k e Tj;'^)} n < y^^'^' < e-^} n g ^ o^,™,,(i). (5.2) 

Consequently, 

¥{{k G ar^-^'} n {£ < y^-^' < e-^} n {z^^-'"' < n^^)"-'}) = o^,™,e(i). (5.3) 
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Proof. By Proposition 13.21 whp 

n*''"^<il^'"^+e^<i;<''"^(l + e'), (5.4) 
where the last inequaUty follows from Ym'^^ < s~^. Therefore, also 

y^'-'^) >y^!^'^'(l-2e2), (5.5) 

when e is so small that (l + e^)^"'^>l — 2e^. In a similar way, we conclude that 

Y^'"^ <Y^'^''\l + 2e^). (5.6) 
Furthermore, the event Zj^'^^ G [jV"(i-£)^ JV"^-'^^^)] is equivalent to 

(1 - e)uK-'' log N < y^*"-'^' < (1 + e)uK-'' log N. (5.7) 
Therefore, we obtain that, with log N, 

Y^'''^ < il + 2£^){l + e)uK"HogN <{l + 2e)uk,N- (5.8) 

Similarly, we obtain 

Y^'""^ > {l-2e'^){l-e)uK-''logN> {l-2e)uk,N- (5.9) 
We conclude that the events k £ ^^•''\e < Yjh''''' < e"^ and Z^''"^' G [jyHi-e) ^ jyu{i+e)^ -^^ply 

y^'-'^' Gnfc,^.[l-2e,l + 2e] = [nfe,^(l - 2e), nfc,^(l + 2e)]. (5.10) 
Since e < Y^^ < e~^, we therefore must also have (when e is so small that 1 — 2e > i), 

Uk^N^i^,^]- (5.11) 

Therefore, 

limsuplimsuplimsupP({fc £ T^^''^} n{e< y^^'^' < e'^ n {Z^'"'^' E [iV"(i-^), Ar"(i+^)]}) (5.12) 
< limsuplimsuplimsup sup P(y^'''^^ G x[l — 2e, 1 + 2e]) . 



ejO m-^oo Af^oo a;G[f,f] 

Since l^''^' = 1^'' whp by Proposition 13.11 we arrive at 
limsuplimsuplimsupP({A: G T^'''^ D {e < Y^'''^ < e'^} D {Z^'"'^' G [iV«(i-^), Ar«(i+^)]}) (5.13) 

ej.0 m— >cxD N^oo 

< limsuplimsup sup P(yW Ga;[l-2e,l + 2e]). 

We next use that Ym converges to y*'^ almost surely as m — > 00 to arrive at 
limsuplimsuplimsupP({A; G T^'''^ D {e < Y^'''^ < e'^} D {Zj!'''^ G [iV«(i-=), Ar«(i+^)]}) (5.14) 

£j,0 m^oo N^oo 

< limsup sup P(y'*> G x[l - 2e, 1 + 2e]) < limsupsup[Fi(a;(l + 2e)) - Fi(x(l - 2e))], 

where Fi denotes the distribution function of Y'-^\ which is continuous for x > 0. Moreover, 

lim 1-Fi{x) = 0. (5.15) 
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Therefore, uniformly in e < 1/4, 



sup[Fi{x{l + 2e)) - Fi{x{l - 2e))] < 2 sup [1 - - 2e))] ^0, {K ^ oo), (5.16) 

x>K x>K 



SO that 



sup[Fi(x(l + 2e)) - Fi{x{l - 2e))] (5.17) 

a:>0 

< sup [Fi(x(l + 2e))-Fi(x(l-2e))] + sup[Fi(x(l + 2e))-Fi(x(l-2e))] 

0<x<K x>K 



sup [Fi(x(l + 2£)) - Fi{x{l - 2e))\ + o(l). 

Q<x<K 



Therefore, 



hmsupsup[Fi(x(l + 2e))-Fi(x(l-2e))] = lim sup hm sup sup [Fi(x(l + 2e))-Fi(x(l-2e))] = 0, 

eiO x>0 K]oo eiO 0<x<K 

(5.18) 

since Fi is uniformly continuous on {0,K]. This completes the proof of the first statement in 
Lemma 15.11 

We turn to the second statement. The event that k G dTm ^ implies that 

y^^.'v) > log N. (5.19) 

r — 1 

By ()5.6() . we therefore conclude that when e is sufficiently small 

y^f'.'^) > ii:£^-(fc+i) log AT, (5.20) 



which is equivalent to 



Therefore, 



> Ar«{r-i) > \ (5.21) 



lim sup lim sup lim sup P ({A; G 9T^"''^'} n {e < y^''"^' < e"^ n {Z^"''^' < iV«{-i)+'}) (5.22) 

ejO m^oo N~*oo 

< lim sup lim sup lim sup P({ A; G T^'-^'} n {e < y^"'~' < e^^} n {Zj^'''^ G [A^^<^"^ iV^<^+^]}) 

£j,0 m— ►oo N^oo 

= 0, 

which follows from the first statement in Lemma EH] with u = — r^— rr- D 

Proof of Lemma 14.81 By ()4.18|) , it suffices to prove that 

limsuplimsuplimsupP({i/Ar > k^f} f] Em,N n Fm,N n {BNie^k^) = 0}) = 0, (5.23) 

which shows that in considering the event {Hj^ > kj^} n Em,N H Fm,N-, we may assume that 
B^{e,kr,)^0. 

We define the random variable /* by 

/* = sup{A; : B^ie, k) / 0}. (5.24) 

Observe that if Bff{e,k) = 0, then Bisi{£,k + 1) = 0, so that /* is well defined. Indeed, if / G 
Bn{s, k + 1) and / 7^ m, then / — 1 G Bn{£, k). If, on the other hand, Bn{£, k + 1) = {m}, then also 
m G ^iv(e, k). Since {BN{e, k^) = 0} = {k^ > I* + 1}, we therefore have 

{Bj,{e, ki,) = 0} = {/* < k^} = {I* <k^-2} U{/* = k^ - 1}. (5.25) 
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We deal with each of the two events separately. We start with the first. 
Since the sets 8^(8, k) are Zm-measurable, we obtain, as in (|4..S0j) . 



F{{H^ > kr,} n n Fm,M n {l* <k^- 2}) < f{{h^ > r + 2} n n F^,, 



E 



+ Op 



We then use (|4.4U|1 to bound 



E 



^Em.NnF,^,NPrn{l* + 2, A;i) 



< E 



7(1, JV) 7(2, iV) 



2L, 



4 



(5.26) 



(5.27) 



Now, since i3jv(e, /*) / 0, we can pick fci such that /ci — 1 e i3jv(e, /*). Since Bn{£, /* + 1) = 0, we 
have fci - 1 ^ i3jv(e, /* + 1), implying I* + 1 - ki € T^'''^ and I* + 2-ki^ Tin'""^ so that, by (TnH) . 

Similarly, since A;i 8^(6,1* + 1) we have that A;i G Tm''^^ and ki + 1 ^ Tm''^\ so that, again 
by (ESI), > N^. Therefore, since > N, whp, 



(5.28) 



and the exponent of N is strictly positive for r G (2, 3) and e > small enough. This bounds the 
contribution in (|5.27l) due to {/* < k^ - 2}. 

We proceed with the contribution due to {I* = kj^ — 1}. In this case, there exists a ki with 
A;i — 1 G i3jv(e, /cat — 1) so that ki G Tm''^^ and fcjv — fci G Tm''^\ On the other hand, i3iv(e, Aj^r) = 0, 
which together with A;i — 1 G Bi^{e, kj^ — 1) implies that k^ — ki G Tm''^\ and fcjv — /ci + 1 ^ Tm^^- 
Similarly, we obtain that ki G Tm'^^ and /ci + 1 ^ Tm'^K Using Proposition 13.31 we conclude that 

T — 2 

There are two possible cases that we will treat separately: (a) Zj^^'^^k-^ < N^^~^^; and (b) 

^f'-fc > iV^^^ By and the fact that fcjv - fci G dTJa''^\ case (a) has small probability, so 
we need to investigate case (b) only. 
In case (b), we can bound 



> kj,} n E^^^ n n {i* = k^-i}n {z^^f\ > iv^+^}) 



E 



l£;,„,]vnF„,jvn{/*=A:jv-l}l,^{ 



{Z'^;f'X>Nr-i^'}n{kj-leBN{e,kM-l)} 



Pmikjsi, ki) + Ojv,m,e(l)) 



and again use ()4.4Up to obtain 

n{HN > k^} n Em,M n n {/* = A:^ - 1} n {zll'2k, > n^^+'}) 



(5.29) 



(5.30) 



< E 



-'--E^,iVnF™,jv(£)n{/*=fejv-l}l 1-^(2, AT) 



'}n{fci-ieBjv(E,fcjv-i)} 



^(l,JV)^(2,JV) 

exp{-^i:i^^^}]+o^,^,,(l). 



We note that by Proposition 13.31 and similarly to 1)5. 28() . 



fei+1 fejv— fei — 



(5.31) 



and again the exponent is strictly larger than 1, so that, following the arguments in H5.26M?3n|) . 
we obtain that also the contribution due to case (b) is small. □ 
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Proof of Lemma 14. Ill Recall that x = k'^Y^'^^ and y = ^Y^'^\ and that x > y. The event 
£rn,N in l|4.7fl|) is equal to 

^— ^ log iV < < log iV, and x + y < (1 + e) log iV. (5.32) 

T — 1 r — 1 

Therefore, by H4.65() . 

y > - > (l-e)^logiV. (5.33) 
K r — 1 

Also, by the bound on x + y in 1)5.32(1 and the lower bound on x in l|5.32() . 

y < (1 + e) log iV - X < (1 + e) log iV - ^—^ log iV = (l + e^-) log A^. (5.34) 

T — 1 T — 2 T — 1 

Therefore, by multiplying the bounds on x and y, we obtain 



and thus 



\2 ^ t^{1,JV)ta(2,JV) - /- 



p(i?^,^ n £m,N n > A;,.}) < p (1 - e)^ < —^Xr::+X::+' < i + — ^ (i + e) 

c log N T — z 



(5.36) 



where we abbreviate c = / . We conclude that 



limsuplimsuplimsupP(i?m,iv H £m,N H {-ffjv > /c^v}) = 0, (5.37) 

e|0 m^oo N^oo 

analogously to the final part of the proof of Lemma l5. 11 □ 

Proof of Lemma 14.121 We recall that Mi = min(gz(«^*^'^^ + '^^'~*^'^^)- We repeat the arguments 
leading to (|4.88Ml^nT|) to see that, as first N ^ oo and then m — > oo, 



p(F^ n n Em,N) < P (-e < Ml - K-'^^-rv^i < ^(1)^(2, > ^ ^^^^^^^ (5_3g) 

= (^_e < M« - K-"~-rV2l < e|y(i)y(2) > o) + Ojv,m(l). 

Recall from Section |21 that, conditionally on y'^^yf^) y ^.j^g random variable Mi has a density. 
Recall that we denoted the distribution function of M/ given y'^'y^^) > by Furthermore, 
i^-aN-Hm £ = [^-r«/2l^^-rV2l+i]^ ^hat, uniformly in N, 

P f-e < M/ - K-'^^-rVsl < ^ y{i)y{2) > o) < sup[F2(u + e) - F2(u - e)] = 0, 



where the conclusion follows by repeating the argument leading to (|5.18l) . This completes the proof 
of Lemma 14. 121 □ 

A Appendix: Proof of Propositions 13. 11 - 13.31 

The appendix is organised as follows. In Section lA.ll we prove three lemmas that are used in 
Section (a. 21 to prove Proposition 13.11 In Section (A. 31 we continue with preparations for the proofs 
of ProDosition l3.2l and l3.31 In this section we formulate key Proposition IA.3.21 which will be proved 
in Section IA. 41 In Section lA. 51 we end the appendix with the proofs of Proposition 13.21 and 13.31 
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A.l Some preparatory lemmas 

In order to prove Proposition 13.11 we make essential use of three lemmas, that also play a key role 
in Section lA.41 below. The first of these three lemmas investigates the tail behaviour of 1 — G{x) 
under Assumption 11.11 Recall that G is the distribution function of the probability mass function 
{gj}, defined in (|rH|) . 

Lemma A. 1.1 If F satisfies Assumvtion M . 1\ then there exists Kj- > such that for x large enough 

where 7(x) = (logx)'^""^, 7 G [0,1). 

Proof. Using definition ()1.8() we rewrite 1 — G{x) as 



(x + 2)[l-F(x + l)]+ \\-F{j)\ 

j=x+2 



Then we use |201 Theorem 1, p. 281], together with the fact that 1 — ^(2;) is regularly varying with 
exponent 1 — r 7^ 1 to deduce that there exists a constant c = c,- > such that 

00 

[l-F{j)\<Cr{x + 2)[l-F{x + 2)]. 

j=x+2 

Hence, if F satisfies Assumption 11.11 then 

1 - G{x) > ^(x + 2) [1 - F{x + 1)] > 

1 - G(x) < i(c + l)(x + 2) [1 - F(x + 1)1 < 

for some K-r > and large enough x. □ 

Remark A. 1.2 It follows from Assumvtion ll . 1\ and Lemma \A.l.l\ that for each e > and suffi- 
ciently large x, 

(A.1.2) 

We will often use \A.l.^) with e replaced by . □ 



Let us define for e > 

n = 

T-1 



1 _ p5 

a = , h = e^, (A. 1.3) 



and the auxiliary event by 

Fs = {yi <x < N'^ : \G{x) - G^''\x)\ < N'^il - G{x)]}, (A.1.4) 
where is the (random) distribution function of {gj^'}, defined in (|3.2|) . 
Lemma A. 1.3 For e small enough, and N sufficiently large, 

P(i^e) < N-^. (A.1.5) 
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Proof. First, we rewrite 1 — G'^'(x), for x £ Nq, in the following way: 

CXD Af oo 1 ^ 



n=x+l 

N Dj 



j=l n=x+l 



-j^ ^. J ^ cjo Af 

— X] X] ^{Dj>x+2} = 1— X X ^ 



L{Dj>x+2} — y- 

where x V / is the maximum of x and I. Writing 



{_Dj>(x+2)VZ}) 



1=1 j=l 



N 



we thus end up with 



j(iV) 

(a;+2)vr 



l-&'^\x) = ^Y.B[ 
1=1 

We have a similar expression for 1 — G{x) that reads 

^ oo 

1 - = - ^¥{Di > (x + 2) V /). 

Therefore, with 



1^ 



1=1 



l-h 

7^' 



and 



X 



l + 2h 
7^' 



we can write 



[G(x)-G(-)(x)] =(^-i)[i-G(x)] 



n{iV) 

■°{a;+2)Vi 



Arp(Di > (x + 2) V /) 



+^ Er^.+i [i?i:i2)v. - ^HDi > (X + 2) V /)] 



+1^7 Ez=ivx+i [Bi:U),, - NF{D, > (X + 2) V /)] . 
Hence, for large enough and x < A^" < < A^^, we can bound 



Rr,ix) = G{x) - G(^)(x) 



< 



Nji 



[1 - G(x)] 



+ L« 2^1=1 



d(JV) 

-°(x+2)V/ 



iVP(Z)i > (x + 2) V /) 



+^Elm^i\Br-NF{D,>l)\ 

I 1 V^OO Tj(N) 

Ln l^l=Nx+l 

+T^Er=m+iNnD,>l). 



(A.1.6) 



(A.1.7) 



(A.l. 



(A.1.9) 



(A.l. 10) 



We use HA.1.2f b')) to conclude that, in order to prove P(-Fjf) < A^ it suffices to show that 



(a) 

(c) 

id) 

(e) 
(A.1.11) 



U {|i?^(x)| >C,Ar-V— ^} j 



< N- 



(A.1.12) 



for large enough A'^, and for some Cg, depending on distribution function G. We will define an 
auxiliary event ^jv,£, such that \Rj^{x)\ is more easy to bound on yljv,e and such that F{A'^j^ ^) is 
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sufficiently small. Indeed, we define, with A = 3(/3 + 2h), 

A-Aa) = {\^ - 1\ < N-^^} , (a) 

^N,eib) = {mayii<j<N Dj < N^} , (b) (A.1.13) 

A^,,{c) = n jl^f' - NF{Di >x)\< ^A(logiV)iVP(Z)i > x)| , (c) 

l<x<N0 ^ ^ 

and 

= A^^,{a) n A^^eib) n Aiv,e(c). 
By intersecting with Ajv,e and its complement, we have 

P( U {\Rn{x)\ > CgN-^x^—^}) 

l<x<N'^ 



< ¥{A^^, nl U {\RNix)\> CgN-f'x^'--''} \ ) + P(A5 

l<x<Afa 



(A.1.14) 



We will prove that P(^^ s) — ^' ^^'^ that on the event Ajv,£, and for each 1 < x < A^", the right 
hand side of HA.1.11|) can be bounded by CgN~^x'^~'^~^ . We start with the latter statement. 

Consider the right hand side of (|A.1.11() . Clearly, on 74jv,e(a), the first term of |i2jv(2;)| is 
bounded by A^'^'^fl - G{x)] < CgN~^'^x^-^+^ < C^A^-'^x^"^"'', where the one but last inequality 
follows from l!^AA.2i h)). and the last since x < < N so that x^'^ < A^^''. Since for / > A^ and 
each j, 1 < j < A, we have Dj > I is the empty set on j4jv,e(6), the one but last term of |i?jv(x)| 
vanish on Ajv,e(6). The last term of \Rn{x)\ can for A large be bounded, using the inequality 

> A and (jXTSfa)), 



1 MXl^-T+ri) 

— V APpi > /) < V < < Cgiv-^+^^^-r+h) < CgN-^x^-^-^, 

i=m+i i=m+i 

for all X < A", and where we also used that for e sufficiently small and r > 2, 

X{2 - T + h) < -h + a{2 - T + h). 
We bound the third term of \Rn{x)\ as 

— - NF{Di > 0| < — Y] + NF{Di > I)] 

< m[N-^B'-^^ + F{Di > A^)]. (A.1.15) 

We note that due to ()A.1.2r al). 

F{Di > N^) > Ar/3(i-^-'^), (A.1.16) 

for large enough A, so that 



= jA{logN)NF{Di > N(^) < NF{Di > A^). (A.1.17) 
Therefore, on >ljv,e(c), we obtain that 

B^j < 2AP(i:>i > A^), (A. 1.18) 

for e small enough and large enough A. Furthermore as e | 0, 
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for X < N", 2 — r — /i < 0, because (after multiplying by r — 1 and dividing by e^) 

X + P{l-T + h) <-h + a{2-T-h), or e{2 + 2t - h) < t - 2 + h, 

as e is sufficiently small. Thus, the third term of |i2iv(x)| satisfies the required bound. 
We bound the second term of \Rn{x)\ on Apf^ir{a) R ^jv^£(c) by 

1 ^ ^AilogN)NF{D^ > (x + 2) V /) = ^^^^^ ^ ^P(Z)i > (x + 2) V /). (A.1.19) 

1=1 ^ 1=1 

Let c be a constant such that (P(£>i > < cx(^~^+'')/2, then for all 1 < x < iV", 

Ar/3 Ar/3 

E l^*S.,v, - A'r(i). > (. + 2) V ,) I < E ((- + 2) V 

1=1 ^ 1=1 

cVAlogiV r^f3_^ , , ^,/3(3-r+W/21 / 2cVAlogiV 



< _ 

< ^/^~l/2^/3(3-r+h)/2 ^ ^^^_h^a(2-r-h) < ^^j^^h^2-T-h ^ (A.1.20) 

because 

/i-l/2 + /3(3-T + /i)/2 < -/i + a(2-r-/i), or /i(5r - 4 - /i) < 2e^(r - 2 + /i), 

for e small enough and r G (2,3). We have shown that for 1 < x < N°^, N sufficiently large, and 
on the event ^iv,e, 

|-Riv(x)| < CgN-^x^-^-^. (A.1.21) 
It remains to prove that F{A% ^) < N^^. We use that 

P(^^^^,) < W{A^^,{ar) + ¥{A^^,{br) + F{A^,,{cr), (A.1.22) 

and we bound each of the three terms separately. 
The bound 



P(^iv,.(a) 



1 ^ 



> • Lj,/N I < -iV"'^, (A.1.23) 

3 



1 ^ 

follows, since N ^-i X]j=i(-C)j — Z^), converges to a stable law, for 2 < r < 3. 

The bound on F{Aj^^i;(by) is a trivial estimate using (|A.1.2r a)). Indeed, for large, 

F{A^^s{by) = F(^max^Dj > < NF{Di > m) < iVx{i-r+/>)+i < ^N''', (A.1.24) 

for small enough e, because t > 2 + h. For the third term P(^jv,£(c)'^), we will use a bound given 
by Janson |23], which states that for a binomial random variable X with parameters N and p, and 
all t > 0, 

r(|.Y-iVp|>()<2exp{-5|^^^}. (A.1.25) 



We will apply HA.1.25|) with t = ajv(x) = y^yl(log A)AP(Di > x), and obtain that uniformly in 
X < A^", 

P (iSf - A^P(L'i > x)\ > aj,{x)) < 2exp ^ "'^^'^''^ 



2(AP(L>i > x) + aj^{x)/3) 
2(1 + IV^log A/(AP(Z)i > m))) 



< 2 exp I , ^ I < 2A-^/^ (A.1.26) 
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because 



log 



< 



logiV 



NF{Di > iV") - Ni+a{T-i-h) 
as N ^oo. Thus, (IA.1.261) gives us, using A = 3(/5 + 2h), 



0, 



,e{cT) < - NF{Di > x)\ > a^(x)) < 2N'^~^/^ = 2N-^^ < -N'^. 



x=l 



This completes the proof of the lemma. 

For the third lemma we introduce some further notation. For any x G N, define 



(N) 
i ' 



Vi"' = max Xf\ 

l<i<x 



i=l 



where {Xf^^}^^^ have the same law, say H^^\ but are not necessarily independent. 
Lemma A. 1.4 (Sums with law H'^^^ on the good event) 

(i) //F(^) satisfies 

[1 - i?<~H^)] < [1 + 2N-^\[l - G{z)l yz<y, 
then for all x > 1, there exists a constant h' , such that: 

P(sf)>y) <6'x[l + 2iV-^][l-G(y)]; 

(ii) satisfies 



l-H^''\y)]>[l-2N~'][l-G{y)\, 



and ^ are independent, then for all x > 1, 

P(v;<"'<y) < (l-[l-2iV-^][l-G(2/)]) . 

Proof. We first bound f(s^^'' > y) . We write 



' >y)< P(5r > y,Vr' < y) +F{V^-^ > y). 
The second term is bounded due to HA.1.28|) by 

xF{X['''> >y)=x\l- (y) 1 < x[l + 2N-^] \l - G{y) 



(A.1.27) 

□ 



(A.1.28) 

(A.1.29) 
(A.1.30) 

(A.1.31) 



(A.1.32) 



(A.1.33) 



We use the Markov inequality and (|A.1.28() to bound the first term on the right-hand side of 
(lA. 1.321) by 



■^X — in ' X 

y 

X 



y \ - \vx ^yj-/ y 

y 

X 



< ^ - iJ(-'(z)] < ^[1 + 2N-^] ^[1 - G{€)\. 



y t 



(A.1.34) 



i=l 
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For the latter sum, we use j2Ul Theorem 1(b), p. 281], together with the fact that 1 — G{y) is 
regularly varying with exponent 2 — r 7^ 1, to deduce that there exists a constant ci such that 

J2[^-G{i)]<cMl-G{y)]. (A.1.35) 

i=l 

Combining (lA. 1.321) . (lA. 1.331) . (lA. 1.341) and (lA. 1.351) . we conclude that 

'S^f^>y)<b'x[l + 2N-%1-G{y))], (A.1.36) 



where 6' = ci + 1. This completes the proof of Lemma lA.1.4r i'). 

For the proof of (ii), we use independence of {X^'^^}f^i, and condition (|A.1.3fl|) . to conclude 
that 



P(v;w<yj = (^<"'(y)j = (1- [i-i/<"'(y)Jj < (l-[l-2iV-''][l-G(y)]^ 

Hence, (IA.1.31|1 holds. □ 

Remark A. 1.5 In the proofs in the appendix, we will only use that 

(i) the event Ff, holds whp; 

(ii) that Lff is concentrated around its mean; 

(iii) that the maximal degree is bounded by for any x > ^/{t ~ 1); with whp. 

Moreover, the proof of Provosition \3.1\ relies on '^21, Proposition A.3.1], and in its proof it was 
further used that 



(iv) ^ A^"^, whp, for any a2 > 0, where pjv is the total variation distance between g and g^^\ 
i.e., 

Pn 



lY.\9n-9n- (A.1.37) 



Therefore, if instead of taking the degrees i.i.d. with distribution F, we would take the degrees in 
an exchangeable way such that the above restrictions hold, then the proof carries on verbatim. In 
particular, this implies that our results also hold for the usual configuration model, where the degrees 
are fixed, as long as the above restrictions are satisfied. □ 

A. 2 Proof of Proposition IXn 

The proof makes use of j2H Proposition A.3.1], which proves the statement in Proposition l3.1l under 
an additional condition. 

In order to state this condition, let, for i = 1,2, {Zj*'^'}j>i be two independent copies of the 
delayed BP, where Z'^'^^ has law {fn} given in Hl.l|) . and where the offspring of any individual 
in generation j with j > 1 has law {gii^^}, where ^n^' is defined in (|3.1j) . Then, the conclusion 
of Proposition 13.11 follows from [211 Proposition A.3.1], for any m such that, for any ry > 0, and 
i = l,2, 

m 

By exchangeability it suffices to prove ()A.2.1|) for i = 1 only, we can therefore simplify notation 
and write further Z^^' instead of Z''f!'^\ We turn to the proof of HA.2.1|) . 
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By Lemma Fa. 1.31 and ()A.1.2r b)). respectively, for every rj > 0, there exists a > 0, such that 
whp for ah x < N", 

1 - G(^'(x) < [1 + 2iV-'^][l - G(x)] < CrjX^-^+'i. 
We call a generation j > 1 good, when 



1 — 2 — ?7 



and bad otherwise, where as always Zq'^^ = 1. We further write 

Hm = {generations 1, . . . , m are good}. 

We will prove that when Hm holds, then X^jLi ^j^' — Indeed, when generations 1, 
all good, then, for all j < m, 

if <(iogiv)S^=i(^-2-'')-\ 

Therefore, 

^Zj^' < m(logiV)S»'^i(^-2-^) ' < m(log7V) (r-2-^)-i-i < jy'', 
i=i 

for any > 0, when is sufficiently large. We conclude that 

m 

P(5^Zf^>Ar^)<P(/7^), 
i=i 

and Proposition 13. II follows if we show that ¥{H^) = o(l). In order to do so, we write 

m— 1 

For the first term, we use (|A.1.2r a)') to deduce that 

P(Ff) =P(I?i > {logN)^=^) < (logAf)"^^ < (logA^)~\ 
For 1 < J < m, we have Zj^' < YlT=i ^'C^ ^ ^'^'^ using ()A.2.6|1 . 

(t-2-7,)- 



(A.2.2) 
(A.2.3) 



^ij"^' < m(logAr) (-2-'))-^-i = K 



(A.2.4) 
. . , m are 

(A.2.5) 
(A.2.6) 

(A.2.7) 

(A.2.8) 

(A.2.9) 
(A.2.10) 



Using Lemma lXXIF i'l with H'-^'> = G'-^\ x = I and y = Vm 
from (IA.2.21) . we obtain that 



log A^) --2-" , where (|A.1.28j) follows 



m^+i n F,) < 5;p(zj-i > ^,|zf' = /)p(Zf' = /) < ^ max P(sr > 
«=i — ^ 



< max P(5<^' > Vm) + h' max l[l + 2Af-''] [l - G{vt,)\ 



l<l<Xo 



(A.2.11) 



The case where / < Xq can, by HA.2.2|) . be bounded as 



t ( ( 

XoP(5]^f' > v^) < x„p( U{Xf' > v^/l}) < J]p(Af' > v^/l) 



< c,/xoK/0'-^+'' = |2^(logAf)-^ < C(logA^)-i. 

Furthermore by (|A.1.2r b)). 

max /[I - G{v^)] < max < (log A^)-^ 

This completes the proof of Proposition 13.11 



(A.2.12) 

(A.2.13) 

□ 



36 



A. 3 Some further preparations 



Before we can prove Propositions 13.21 and 13.31 , we state a lemma that was proved in . 

We introduce some notation. Suppose we have L objects divided into groups of sizes 



di, . . . , djv, so that L = Yl^=i ^i- Suppose we draw an object at random. This gives a distribution 



5^, i.e., 



N 



1=1 

Clearly, g^'^^ = g^'^\ where D = {Di, . . . ,D]^). We further write 



(A.3.1) 



(A.3.2) 



n=0 



We next label M of the L objects, and suppose that the distribution Gm\x) is obtained in a 
similar way from drawing conditionally on drawing an unlabelled object. More precisely, we remove 
the labelled objects from all objects thus creating new d[,...,d'^, and we let Gif{x) = G^^^x). 
Even though this is not indicated, the law G^? depends on what objects have been labelled. 

Lemma IA.3.11 below shows that the law G^^f can be stochastically bounded above and below 
by two specific ways of labeling objects. Before we can state the lemma, we need to describe those 
specific labellings. 

For a vector d, we denote by < d^2) < • • • < d(jv) the ordered coordinates. Then the laws 
and G|,? , respectively, are defined by successively decreasing d(jv) and d^i^ , respectively, by one. 
Thus, 



G['\x) 



Gf{x) 



1 



N-l 



L 



L 



d 



(JV) 



i=l 
N 



i=2 



L 



'-{rf(iV)-l<a;+l}' 



L - 1 -^Hi)-l<^'+l}- 



(A.3.3) 
(A.3.4) 



For Gj^j and G^i , respectively, we perform the above change M times, and after each repetition 
we reorder the groups. Here we note that when d(jv) = 1 (in which case di = 1, for all i), and for 

G^-f^ we decrease (i(jv) by one, that we only keep the groups with di = 1. The same rule applies 
when = 1 and for G[''^ we decrease ci(i) by one. Thus, in these cases, the number of groups of 
objects, indicated by A^, is decreased by 1. Applying the above procedure to d = {Di, . . . , Dpj) we 
obtain that, for all x > 1, 



Gi:\x) ^Gi:\x) < 



1 



N 



Gir{x)^ar\x)> 



(D), 



Lr, -M 
1 



+1} 



i=l 
AT 



M 



[ Al{D,<x+l} 



L^-M 
- M 



:G 



i=l 



M 



Lj,G^''\x)-M 



(A.3.5) 
(A.3.6) 



where equality is achieved precisely when -D(iv) > x + M, and ^{i ■ Di = 1} > M, respectively. 

Finally, for two distribution functions F, G, we write that F < G when F{x) > G{x) for all x. 
Similarly, we write that X when for the distribution functions Fx, Fy we have that Fx ^ Fy- 

We next prove stochastic bounds on the distribution G[f{x) that are uniform in the choice of 
the M labelled objects. The proof of Lemma lA. 3. II can be found in |21j . 
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Lemma A. 3.1 For all choices of M labelled objects 



Gj!> ^ Glf ^ Glf . (A.3.7) 



Moreover, when Xi, . . . ,Xj are draws from Gm\ , • • • , Gm\ , where the only dependence between 
the Xi resides in the labelled objects, then 

j j j 



1=1 i=l 1=1 



or 



where {2Li}i=i (I'^d {XiY-^^, respectively, are i.i.d. copies of X_ and X with laws G^f^ and Gj^j f 
M = maxi<j<; Mi, respectively. 

We will apply Lemma [011 to G'^' = G*'^'. 
A. 3.1 The inductive step 

Our key result, which will yield the proofs of Proposition 13.21 and 13.31 is Proposition IA.3.'2l below. 

This proposition will be proved in Section (A. 41 For its formulation we need some more notation. 
As before we simplify notation and write further on instead of Z''f^'^\ Similarly, we write 
instead of Z^'' and T^\e) instead of Tm^^e)- Recall that we have defined previously 

> 1 and a 



T-2 r-1 

In the sequel we work with Y^'^^ > e, for k large enough, i.e., we work with > e^'^'' > 1, due 
to definition 1)3. 3(1 . Hence, we can treat these definitions as 

y,'^^ = K-'=log(Zf' ) and n = /^-Mog(Zfe). (A.3.9) 

With 7 defined in the Assumption !!.!! and < e < 3 — r, we take rUe sufficiently large to have 

oo oo 

E - 2 + e)^(i-T) < £3 and E '^"^ ^ (A.3.10) 

k=Tnir k=ms 

For any < m < k, we denote 

k k 

M'^'> = Y,ZT\ Mk = Y.^r (A.3.!l) 

As defined in Section 3 of we speak of free stubs at level /, as the free stubs connected to 
nodes at distance / — ! from the root; the total number of free stubs, obtained immediately after 
pairing of all stubs of level / — ! equals Z;*^' . For any / > ! and \ < x < Z^'^l, let Z^^' denote 
the number of constructed free stubs at level / after pairing of the first x stubs of Z;^|- Note that 
for X = Z^^l, we obtain Z^^' = Z/^^ For general x, the quantity Z^.^' is loosely speaking the sum 
of the number of children of the first x stubs at level I — 1, and according to the coupling at fixed 
times f Proposition !3. l)) this number is for fixed /, whp equal to the number of children of the first 
x individuals in generation / — I of the BP {Zk}k>i- 

We introduce the event Fm,k{^), 

{ke%i!'\e)} (a) 

i'mm- n{e<yr'<e-i} (c) ^^-^-^^^ 

n{Mr^ < 2Zr}. {d) 
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In the proof of Proposition IA.3.'2l we compare the quantity Z^'J' to the sum Yli=i -^il-i ^'^^ part 
(a) and to maxi<i<x ^l^Li for part (b). We then couple ^^-^l]^ to xl'J!_i for part (a) and to X..^^ ^ 
for part (b). Among other things, the event -^^^^(s) ensures that these couphngs hold. 

Proposition A. 3. 2 (Inductive step) Let F satisfy Assum^ption M . 1[ For e > sufficiently small 
and sufficiently large, there exist a constant b = b{T, e) > such that, for x = Z^_!^ A N "(^-i) , 

IP(i^m,z(e) n {Z^y > < (a) 

F^F^^iie) n {Z^y < ( -)''-^-^(-) }) < 6/-3. (6) 

The proof of Proposition IA.3.21 is quite technical and is given in Section IA.4I In this section we 
give a short overview of the proof. For / > 1 , let SPGi denote the shortest path graph containing 
all nodes on distance I — 1, and including all stubs at level I, i.e., the moment we have Zj^^^ free 
stubs at level /. For i G {1, . . . , x}, let A'j'^li denote the number of brother stubs of a stub attached 
to i^^ stub of SPGi-i (see Figure El). 



a) b) 




Figure 4: The building of the l^^ level of SPG. The last paired stubs are marked by thick lines, the brother stubs 
by dashed lines. In a) the {I — 1)°* level is completed, in 6) the pairing with a new node is described, in c) the pairing 
within the {I — 1)"* level is described, and in d) the pairing with already existing node at l^^ level is described. 
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Because ^ is the number of free stubs at level / after the pairing of the first x stubs, one 
would expect that 

^17~E4^P (A.3.13) 

i=l 

where ~ denotes that we have an uncontrolled error term. Indeed, the intuition behind ()A..S.13|) 
is that loops or cykels should be rare for small /. Furthermore, when M^'^] is much smaller than 
A'^, then the law of A^'^li should be quite close to the law G*'^', which, in turn, by Lemma lA. 1.31 is 
close to G. If A^j'^li would have distribution G{x), then we could use the theory of sums of random 
variables with infinite expectation, as well as extreme value theory, to obtain the inequalities of 
Proposition IA.3.21 

In order to make the above estimates rigorous, we use upper and lower bounds. We note that 
the right-hand side of (|A.3.13|) is a valid upper bound for Z^J^^ . We show below that X^'^^^ have 
the same law, and we wish to apply Lemma lA.1.4r i'). For this, we need to control the law X^'^^^, 
for which we use Lemma lA.3.11 to bound each X^'^^-^ from above by a random variable with law 

This coupling makes sense only on the good event where is sufficiently close to G. 
For the lower bound, we have to do more work. The basic idea from the theory of sums of 
random variables with infinite mean is that the sum has the same order as the maximal summand. 
Therefore, we bound from below 

> Ziy - X. (A.3.14) 

where 

However, this lower bound is only valid when the chosen stub is not part of the shortest path 
graph up to that point. We show in Lemma lA.3.41 below that the chosen stub has label 1 when 
> 2M/^]. In this case, ()A.3.14|) follows since the x — 1 remaining stubs can 'eat up' at most 
X — 1 < X stubs. To proceed with the lower bound, we bound (A|^^-^, . . . , A^^^-^) stochastically 
from below, using Lemma lA.3.H by an i.i.d. sequence of random variables with laws Gi^\ where M 
is chosen appropriately and serves as an upper bound on the number of stubs with label 3. Again 
on the good event, G^^P is sufficiently close to G. Therefore, we are now faced with the problem 
of studying the maximum of a number of random variables with a law close to G. Here we can 
use Lemma lA. 1 .4r ii) . and we conclude in the proof of Proposition IA.3.'2l^ a) that Z_^^i is to leading 

I-e/2 

order equal to x'^, when x = Z^'^j A A^'^^^) . For this choice of x, we also see that Z_^^i is of bigger 
order than M^^^^ so that the basic assumption in the above heuristic is satisfied. This completes 
the overview of the proof. 

We now state and prove the Lemmas IA.3.31 IA.3.41 The proof of Proposition IA.3.21 then follows 
in Section fA.4[ We define the good event mentioned above by 

x=l 

(A.3.16) 

The following lemma says that for M < N°' the probability of the good event is close to one. 
Lemma A. 3. 3 Let F satisfy Assumvtion M . 11 Then, for e > sufficiently small, 

^{Fl^c) < A-'', for large A. 
Proof. Due to Lemma lA.1.3l it suffices to show that for e small enough, and A sufficiently we have 

Kn'^ C F^. (A.3.17) 
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We will prove the equivalent statement that 



FeCFe^NC. (A.3.18) 
It follows from (|A ..3.511 and (lA. 3.611 that for every M and x 

1 - Gif\x) < 1 - < 1 - (A.3.19) 

and, in particular, that for M < N", 

[l-Gl^\x)] - [l-Gif\x)] < < CN--\ (A.3.20) 

Then we use ()A.1.2r b)'l to obtain that for all x < N°', e small enough, and N sufficiently large, 

(A.3.21) 

Therefore, for M < and with the above choices of e, a and h, we have on F^, 

[1 - Gi:'\x)] < 1 - + [1 - Gr(x)] - [1 - QiPix)] < [1 + 2iV-'^][l - G{x)], 

[1 - Glf'(x)] > 1 - - [1 - Gr(x)] + [1 - Gir(x)] > [1 - 2iV-^][l - Gix)], 

i.e. we have ()A.3.16|1 . so that indeed i^^ C F^^j^a . □ 
For the coupling of ^j^^li with the random variables with laws G^f '(x) and gI^\x) we need 
the following lemma. 

Lemma A. 3. 4 For any I > 1 there are at most 2M^''^' stubs with label 3 in SPGi^i, while the 
number of stubs with label 2 is equal to ^/^j ■ 

Proof. The proof is by induction on There are Zj^' free stubs in SPGi. Some of these stubs 
will be paired with stubs with label 2 or 3, others will be paired to stubs with label 1 (see Figure [l]). 
This gives us at most 2Z|^' stubs with label 3 in SPG2- This initializes the induction. We next 
advance the induction. Suppose that for some / > 1 there are at most 2M^'^^ stubs with label 3 in 
SPGi+i. There are -^/^j free stubs (with label 2) in SPGl-^-l. Some of these stubs will be paired 
with stubs with label 2 or 3, others will be linked with stubs with label 1 (again see Figure |1J). This 
gives us at most ^Zj^^l new stubs with label 3 in SPGi-^-2- Hence the total number of these stubs 
is at most 2Mj*^' + '^Z^^^ = 2Mj*^j. This advances the induction hypothesis, and proves the claim. 
□ 

A. 4 The proof of Proposition IA.3.21 

We state and prove some consequences of the event Fm^ki^), defined in ()A.3.12|) . We refer to the 
remark, following definition ()A.3.12|1 . to explain where we use these consequences. 

Lemma A. 4.1 The event -Fm.,fc(£) implies, for sufficiently large N, the following bounds: 

1-3e"^/4 

(a) Ml!:2i<N^(^, 

(6) for any 6 > 0, N'^ < k-\ ^^^^^^^ 

(c) K^-^{e - e^) < log {Z]^2i) < K^'^ie-^ +e^), for k-l> m, 

(d) Ml'^i < 2Zl!2i for k-l>m. 
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Proof. Assume that ()A.3.12r aVfd)) holds. We start by showing HA.4.ir b')^. which is evident if 
we show the following claim: 

logf^fe^rlogiV 



k < ^^^^ ^, (A.4.2) 

logK 

for large enough. In order to prove HA.4.2() . we note that if A: G Ti^\s) then, due to defini- 
tion (jSISl), 

< , ^jZn < -^T^^-'^logN, (A.4.3) 
r - 1 log {Zin') - 1) 

where the latter inequality follows from Y^^' > e and HA.3.9|) . Multiplying by and taking 
logarithms on both sides yields HA.4.2|1 . 
We now turn to HAA.li a)). bmce 



M^, =yzl''^<k max Zj 

l=\ 

the inequality ()A.4.ir a)) follows when we show that for any I < k — 1, 



k-l 

'I ' 

1=1 



Zf'<N^(^. (A.4.4) 

Observe that for I < m we have, due to (|A.3.12r c)) and ()A.3.12r d')). for any e > and m fixed and 
by taking A^ sufficiently large, 

Z/^^ < M^^) < 2Z^^ < le""^'^ < N^^. (A.4.5) 
Consider m < I < k — 1. Due to ()A.3.9|) . inequality (|A.4.4|1 is equivalent to 



^«+ly^(iV) < log (A 4 g) 



To obtain ()A.4.6|1 we will need two inequalities. Firstly, ()A.3.12r a)) and I + 1 < k imply that 



l-e2 



r — i 

Secondly, (IA.4.71) and HA.^.Ui c)) imply that 

< ^ log A^. (A.4.8) 

e(r - 1) 

Given HA.4.7|) and (|A.3.12r b)'). we obtain, when > e, and for m < ^ < A; - 1, 



<M^log^ = T^logAr. 



(A.4.9) 



Hence we have (|A.4.6p or equivalently ()A.4.4|) for m < I < k — 1. 

The bound in (|A.4.ir c)) is an immediate consequence of ()A.3.9|) and ()A.3.12r b.c)) that imply 
for k — 1 > m, 

We complete the proof by establishing (|A.4.ir d)). We use induction to prove that for all / > m, 
the bound M^*^' < 2Z^^^ holds. The initialization of the induction hypothesis for / = m follows 
from ()A.3.12r d)). So assume that for some m < / < A; - 1 the inequality M/"^^ < 2^/"^' holds, then 

M/^] = Z'^l + M/^^ < + 2Z/^^ , (A.4.10) 
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so that it suffices to bound 2Zj*'^-' by Z^'^j . We note that -Fm,A;(£) imphes that 

|yW _ yW| < |yW _ yW| + |yW _ yW| < < 2,e^Y,^^l. (A.4.11) 

Therefore, 

2Z/"' = 2e'^'^/"' < 2e(i+='^')'^'<"' = 2(z;;|) < Z^, (A.4.12) 

when e > is so small that uj = (1 + 3e^)K^^ < 1 and where we take m large enough to ensure 
that for / > m, the lower bound -^/^j = expl^^'^^l^'^j^*} > exp{K'"'"^e} > 2 1-^ is satified. □ 

Proof of Proposition IXr3.2lf a^. Recall that a = ^^Erf • We write 

(A.4.13) 

where Pm is the conditional probability given that F^^j holds, and where we have used Lemma fA.3.31 
with A^~'^ < It remains to bound the first term on the right-hand side of (|A.4.13|1 . For this 
bound we aim to use Lemma lA. 1.41 Clearly we have 

^S*< 1:4^1, (A.4.14) 

because loops and cycles can occur (in Figure E] only the case b) contributes to Z^^\ the cases c) 
and d) do not contribute). Since the free stubs of SPGi^i are exchangeable, each free stub will 
choose any stub with label unequal to 3 with the same probability. Therefore, all 4T-i have the 
same law which we denote by H^'^K Then we observe that due to HA.3.8|) . 1 '-'^^ coupled 
with x\'^i^_i having law where M is equal to the number of stubs with label 3 at the moment 
we generate ^1^1^, which is at most the number of stubs with label 3 in SPGi plus 1. The last 
number is due to Lemma lA.3.4l at most 2M^'^\ + 1. By Lemma lA.4.ir a). we have that 

2M/^] + 1 < 2iV^^^(^ + 1 < = iV", (A.4.15) 

and hence, due to (|A.3.8|) . we can take as the largest possible number M = N'^. We now verify 

I-e/2 

whether we can apply Lemma lA.1.4r i). Observe that x < N^^-^^i so that for N large and each c^, 
we have 

y = (/33.)'^+c-,7(^) < (A.4.16) 

since by HA.4.2|) . we can bound I by a double logarithm. Hence HA.1.28|) holds, because we condition 
on Fe . We therefore can apply Lemma IXOr n. with 5^' = ELi^i!I-i> H^"^ = <^!v«> and 
obtain also using the upper bound in ()A.1.1|1 . 

IPiv- [f^A^) n {Z^,i > {fixy^'-^^^''^ }) < b'x[l + 2Ar-^] [1 - G{y)] 

< 2b'xy-''~'+^-^^y^ = 26'x(/\)(-'^"'+^-^(^))(''+"^^(^)) < br^, (A.4.17) 

if we show that 

c^7(x) + Kr7{y)) + KKrjiy) < 0. (A.4.18) 

Inequality HA.4.18|) holds, because 7(y) = (log y)'''^-'^, 7 G [0,1), can be made arbitrarily small by 
taking y large, which follows from ()A.4.ir c)): /'^x > exp{K™'e/2}, and because m can be taken 
large. □ 
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Proof of Proposition IA.3.2r b^. Similarly to (|A.4.13|1 . we have 

p(F^,Ke) n {z^-' < ( }) < P,. n < (-)'^--^(-) }) + r3, (a.4.i9) 

and it remains to bound the first term on the right-hand side of HA.4.19() . Recall that 



1 — A . , 



where, for 1 < i < x, X'^'^^-^ is the number of brother stubs of a stub attached to the i^^ free stub 
of SPGi-i. Suppose we can bound the first term on the right-hand side of ()A.4.19|1 by bl~^, when 
Z^J^i is replaced by Zj^j' after adding an extra factor 2, e.g., suppose that 

Then we bound 

P.<^ (PmAe) n {^S' < }) (A.4.20) 



<p..(F„,,(£)n{z<-'<2(f)^-^-^(^) 
+ P,. (F^,i{e) n {z^' < (f )'^-^-^(^') } n {^3' > 2 ( - )'^-^-^(-) 



By assumption, the first term is bounded by bl~^, and we must bound the second term. We will 
prove that the second term in HA.4.20() is equal to 0. 

For X sufficiently large we obtain from / < Clogx, k> 1, and 7(3;) — > 0, 

X \ K-Cjl{x) 

> 6x. (A.4.21) 



IK 

Hence for x = Z/^} > {e - e^)K^-\ it follows from Lemma EXHd), that > 2 
induces 

Z^' > 6Z/^} > 2M/^] + 2Z/^j. (A.4.22) 

(l-e/2) 

On the other hand, when x = N < Zj^'^^, then, by Lemma lA.4.ir a'). and where we use again 

I < Clogx, K > 1, and 7(x) — > 0, 



/ l-e/2 \ i^-c-ili^) 



V 



/3 



> 2N 1) 2iV^<^ > 2M;_Y + 2x. (A.4.23) 

We conclude that in both cases we have that Z^' > 2M/^] + 2x> 2M/^^ 2x. We claim that the 
event Z^^^ > 2M/^^ + 2x implies that 

> Z}:j - X. (A.4.24) 
Indeed, let io S {1, . . . , N} be the node such that 

and suppose that ig G SPGi^i. Then is at most the total number of stubs with labels 2 
and 3, i.e., at most 2M/^^ + 2x. Hence Z^' < Ao < 2M/5^ 2x, and this is a contradiction 
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with the assumption that Z},^i > 2M/_j^2 + 2x. Since by definition iq G SPGi, we conclude that 
io S SPGi \ SPGi-i, which is equivalent to saying that the chosen stub with Z}^^ brother stubs 
had label 1. Then, on 

> 2^/^2 + 2x, we have (IA.4.24jl . Indeed, the one stub from level / — 1 
connected to iq gives us Zj^^^ free stubs at level / and the other x — 1 stubs from level I — 1 can 'eat 
up' at most X stubs. 

We conclude from the above that 

n < (l)'^-^^(-) } n {Z}:j > 2 }) (A.4.25) 

< P.-^ {Prr^Ae) n < ( - } n {z^-' > 2 ( - )''-^-^(-) - x}) = 0, 

since ()A.4.21|) implies that 

Below we prove in two steps that there exists b such that 

P^«(^m,z(£)n{Z^^;/<2(f)''-^-^(")}) <bl-^. (A.4.26) 

First we couple {-^i7-i}f=i with a sequence of i.i.d. random variables {^^l;^}f=i with law G^', 
such that 

4rii>^Mli> ^ = l,2,...,x, (A.4.27) 

and hence 

ZLV > l^r' max X'^^ , . (A.4.28) 

Then we apply Lemma IJXI^ ii) with l^^'^' = and y = 2(x/Z3)'^-c77{^). 

We prove the fact that we can couple {X|^2i}f=i with a sequence of i.i.d. random variables 
{2[^uLi}i=i with law G^^a by induction on x. For x = 1, the claim follows from Lemma |A.3.1I 
with 1 = 1. Observe that for the x^^ stub at level / — 1, we sample a uniform stub from the stubs 
with labels 1 and 2. Hence, due to HA.3.8|) . and conditionally on , we can bound ^^^Li 

from below by X}^i_i, which has law where M is equal to the number of stubs with label 

3 at the moment we generate X^'^^^_-^. This number is bounded from above by 2M^'^2 + 2x < N"' 

by (|A.4.ir a)) and the fact that x < "(^-i) < ^A". Indeed, the maximal possible value for M 
corresponds to the moment we sample X^^^^, i.e., M is at most the number of stubs with label 3 
in SPGi^i plus 2x for the pairing at most x free stubs of SPGi^i. Hence, due to Lemma FA. 3. 41 M 
is at most 2M/^^ + 2x < A", and due to (lA .3.811 we can take M = A". Therefore, conditionally on 

{^i'l-i}i=i ^ we can bound ^^^Li from below by 2L'x^i-i, which has law and this conditional 

coupling is equivalent to fact that each component of {X^^^^^jf^-^ can be bounded from below by 
the components of iA|^^ i^f-i^ where |A|^' i^f-i i.i.d. copies with law G^'. 

We finally restrict to x = A N^(^ . Note that y = 2(x//3)'"-'=^'^(^) < N°, so that F^^nc. 
holds, which in turn implies condition ()A.1.30|) . We can therefore apply Lemma IA.1.4r ii) with 
= i = 1, 2, . . . , X, = G^^J, and y = 2(x/Z3)''-^77(^) to obtain from (IA.4.28|) . 

n"(F„.,Ke)n{z<-'<2(f)'^-^^^(^) }) 

< P ( max X^^jl-. < y ) < f 1 - [1 - 2A-''] [1 - G (y)] V . (A.4.29) 
yi<i<x ' y \ / 

From the lower bound of HA.1.1|) . 

[1 - G{y)] > y-'^ ~Krj{y) ^ 2''-'''<^^'''\x/l^)^-'^ -■f^-7(y)){«-c^7W) > (A.4.30) 
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because x/l^ > 1 and 

K~^c-yj{x) - KKr^{y) + c^Kr^{x)-f{y) > c^k"V(x) - KKr-f{y) > 0, 

by choosing large and using 7(x) > 7(y). Combining HA.4.29|) and (|A.4.30|) and taking 1 — 
we conclude that 

(l - [1 - 2N-''][1 -G{y)]y < (l - ^) < e-''/' < r', (A.4.31) 

because I > m and m can be chosen large. This yields (|A.4.26|) with 6=1. □ 
In the proof of Proposition 13.21 in Section IA.5L we often use a corollary of Proposition IA.3.21 
that we formulate and prove below. 

Corollary A. 4. 2 Let F satisfy Assumvtion M . 11 For any e > sufficiently small, there exists an 
integer m such that such that for any k > m, 

P {f^M^) n {|yf ' - > (t - 2 + < (A.4.32) 

for sufficiently large N. 

Proof. We use that part (a) and part (6) of Proposition 1^.3.21 together imply: 

r[Fm,k{^) n { |iog (zf ') - ^log (zH) I > ^^ogik') + c,7 log {k'zl!!_\) }) < 2bk-\ 

(A.4.33) 

Indeed applying Proposition IA.3.21 with / = k and x = .Z^^\, and hence Z^^k = Z^, yields: 

F(i^m,fe(e) n {Zf ^ > (A;3x)'^+'=^^(^)}) < bk^'^, (A.4.34) 
P(Fm,fe(e) n {Zf < (x/fc3)---^^W}) < (A.4.35) 

and from the identities 

{Zf ^ > {k''xr+'-^^^^} = {log(Zf' ) - Klog(Zn) > log((A:3x)'^+'=-^(")) - Atlogx}, 
{Zf ^ < (x/fc3)---^^(-')} = {log(Zf ') - Klog(Zn) < log((x/A:3)'^+-^^(-')) _ ^log^;}, 

we obtain (IA.4.33|) . 

Applying HA.4.33|) and (|A.3.9|) . we arrive at 

P (f^M^) n {|y,<'"' - y.'^li > (r - 2 + e)'=(i-^)}) (a.4.36) 

< P (F^,fc(e) n {^-'^ [Klog{k^) + c^7 (^n) log (^'^n)] > - 2 + e)'^^^-^)}) + 2bk-\ 

Observe that, due to Lemma lA.4.ir c'). and since 7(x) = (logx)'^^^, where < 7 < 1, we have 
on Fm,fc(e), 

K-'^ [Klog(A:3) + c^j (ZH) log (fc'^n)] 
-fc 



< K 



K 

k 



K\og{k^) + c, (log (Zn))'"' (log(A;3) + log (ZH)) 



Klog(A:3) + iog(A:3) + (log (ZH))" 

< /t-'^ [(c^ + k) log(fc3) + (K'^-i(e-i + £3))^] 

< K-'^(i-T) [^-'^^(c^ + k) log(fc3) + (K-i(e-i + e3))T] < (r - 2 + e)'^(i-T), 
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because, for k large, and since k ^ = t — 2, 

We conclude that the first term on the right-hand side of HA.4.36P is 0, for sufficiently large k, and 
the second term is bounded by 2bk~^ < k~'^, and hence the statement of the corollary follows. □ 

A. 5 Proof of Proposition f3.2l and Proposition 13.31 

Proof of Proposition I3.2lf a). We have to show that 

P(e < 11""' < max |y,'''"^ - > e^) = (1). 

Fix e > 0, such that r — 2 + e < 1. Then, take m = m^, such that HA.3.10|) holds, and increase m, 
if necessary, until (lA .4.321) holds. 

We use the inclusion that (recall the definition of %^\e) given in 1)3. 5() ). 

{ max |yr-^ri>H^}c{ Eln'"'-n'-ll>E(--2 + ^)'^^^-^^}- (A.5.1) 

'^e^™ (^) fceTr'(e) '^^"^ 

If the event on the right-hand side of ()A.5.1|) holds, then there must be a fc E T^\e) such that 
ly^"^ - yl-i\ > {t-2 + e)^(i-T), and therefore 

{ max \Yr-Yi^^\>e'}c [j F^,fc_inF^,„ (A.5.2) 

where we denote 

Fm,k = F.^,k{e)= f] {|y;~)-y;-)|<(r-2 + ey(i-^)}. (A.5.3) 

j=m+l 

Since HA.3.10|1 implies that on i^m,fc-i we have lYj"^^ — Y^^\ < , m < j < k — 1, we find, 
F^,,_i n F^,, C {\Yr - IT'I <e',yi:m<l<k-l}n {ly^ " ^'-H > - 2 + 

(A.5.4) 

Take sufficiently large such that, by Proposition 13.11 

P (M(f' > 2ZW, £ < y w < £-1) < p (3/ < ^ : i^f'^) ^Yi)+F (M„ > 2Z„, e<Ym< e"') 

<F{Mm> 2Zm, e<Y„,< e'^) + e/4, (A.5.5) 

Next, we use that 

lim P (Mm > 2Zrn, e<Yrn< e^M = 0, (A.5.6) 

since Yi = [t — 2)' log 2^; converges a.s., so that when Ym > £ and m is large, Mm~i is much smaller 
than Zm, so that = ^m-i + -2^m > 22^m has small probability, as m is large. 
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Then we use (|A.5.1|I - ()A.5.6|) . together with (|A.3.12|) . to derive that 

e<Yi-^<e-\ max |y,(-) - > ^3 ] (A.5.7) 



< P e < y^^' < e-\ max - Yj^^\ > e^, Y^^^' = y^, V/ < m + P (3/ < m : y/^' / y,) 

< 5^ p n F^,, n {fc G Ti-\s)} n{e< y^' < e"^} n {y/"' = y,, v; < m}) + 1 

A;>m 

< E ^ (^-.^(^) n {l^r - n'-ll - 2 + +e< 3e/2, 

k>m 

by Corollary IX42I □ 

Proof of Proposition |ni21(b). We first show (|3.8|) . then (|3.7jl . Due to Proposition l'6.'2( a). and 

using that {Y^^^ < s^^}, we find 

Y^-^<Yi-^+e'<Yi-\l + e'), 
apart from an event with probability Ojv,m,e 

(1), for ah k e T^^'K By (fXAQ]) and because k e rr[f\ 

this is equivalent to 

We next show (|3.7() . Observe that A; G T^'^' implies that either k — 1 £ Tm \ or k — I = m. 
Hence, from k G TJ^^ and Proposition I3.2r a^. we obtain, apart from an event with probability 

(1), 

for e > sufficiently small, and 

yr = yr - + - yn + yn > y^-i - > - 4e^)> (A.5.9) 

By ()A.3.9|) this is equivalent to 

V fc — 1 / Pc ' — " 1 ' 

when e > is so small that k(1 — 4e^) > 1, since r S (2, 3), and k = (r — 2)~^. □ 
Proof of Proposition 13. 3L We must show that 

P(A; G dT^''\e),e < y^' < e-\ Zll\ < N^) = o^,™,e(l), (A.5.10) 

where 

{k e dTD = {ke rr'} n{k + i^ T^-'}. 

In the proof, we will make repeated use of Propositions l3.2l and l3.11 whose proofs are now complete. 
According to the definition of Fm,k{£) in (|A.3.12|1 . 

¥{{k e ar^^'(e)} n {e < < e-'} n F^^kier) (A.5.11) 
<P(e<ir'<^-\ max jy/-' - y^l > + P (e < < > 2Z^->) • 
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In turn Propositions a) and 13.11 as well as HA.5.5ffA..5.6|) imply that both probabilities on the 
right-hand side of ()A.5.12|) are Oe(l), as first N oo, and then m oo. Therefore, it suffices to 
show 

P({A; G dTi''He),e < < e-\ < iV^} n F^,fc(e)) 

= ¥{{k + 1 T^~'(e), Z^^^\ < N^} n = o^,^,,(l). (A.5.12) 

Let X = A^'^{^-i) , and define the event Iff^k = -f]v,fc(«) H lN,k{b) n lN,k{c) H lN,kid), where 

/^,fc(a) = {Mn<iV^^^}, (A.5.13) 
= {x<Zf'}, (A.5.14) 

/^,fc(c) = {Zf'<iVT#}, (A.5.15) 

We split 

P({A; + 1 ^ rr'(e), 4+1 < iV^} n (A.5.17) 

+ F{{k + 1 Tr)(e), < iV^} n FrnA^) n I^,,) . 
We claim that both probabilities are small, which would complete the proof. We start to show that 

¥{{k + 1 Ti^^e), Zl!l\ < N^i} n n 7^,^) = o^,rn,.(l). (A.5.18) 

Indeed, by Lemma IF!T] and (|3.8() . 

p({A: + 1 ^ 7;|,^n^)} n ^ > iv^} n n 7^,^) (A.5.19) 

< F{{k G Tr'(e)} n{e< Y^' < ^^"'} n {Zf G [iV^, iVT#]}) + o^,„,,(l) = o^,™,,(l), 



where u = (r — 1) ^. Therefore, we are left to deal with the case where Z^'^' < A^^-i . For this, and 

assuming Ij^ we can use Proposition IXO^ b) with x = iY^c^-D < Z^"^' by 7jv,fc(6), and / = A; + 1 
to obtain that, whp, 

> 4!LVi - ^ - iV^ = AT^i^ _ ATT^ > , (A.5.20) 

where we have used that when k G Tm \s) and 1^^^ > e, then we have k < clog log A^, for some 
c = c(t, e), and hence, for A large enough, 

(yt _)_ l)3(K-CT,7(x))^c^7(a;) < _j_ l)3K^c^7{a;) ^ jjE^/S^ 

This proves ()A.5.18|) . 

For the second probability on the right-hand side of HA.5.17|) it suffices to prove that 

F{{k + 1 Ti^\e)] n F^^kie) n 1%^^) = Oiv,™,.(l). (A.5.21) 

In order to prove HA.5.211) , we prove that HA.5.21() holds with 7^ ^ replaced by each one of the four 
events 7^ ^(a), . . . , 7^ ^(d). For the intersection with the event 7^ ^(a), we apply Lemma rA.4.ir a'). 
which states that Fm^ki^) H 7^ ^(a) is an empty set. 
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It follows from ((231) that if /c + 1 %i!'\e), then 

K^+^y^'^' > log N. (A.5.22) 

If Fm,k{£) holds then by definition ()A.3.12|) . and Corollary FA. 4.21 whp, 

> n'-i > - > - ^'). (A.5.23) 

Hence, if Fm,k{e) holds and /c + 1 T,^f '(e), then, by (IA.5.22|1 ~ (IA.5.23|1 . whp, 

«;log(Zf >) = K^+^Yjf^ > (1 - e2)Kfc+iyW > logiV, (A.5.24) 

so that, whp, 

Z\^^ >x = N^^, (A.5.25) 
for small enough e > and sufficiently large N, i.e., we have 

F{{k + 1 T^^'(e)} n F^,fe(e) n = 0^.,™,e(l). 

From Proposition ESfb) it is immediate that 

P({A; + 1 Ti^\e)} n F^,fe(e) n 4^,(c)) = o^,^,,(l). 

Finally, recall that -^^.'^+1 is the number of constructed free stubs at level k + 1 after pairing of the 
first X stubs at level k. After the pairing of the remaining Z^^^ — x stubs at level k they can 'eat 
up' at most Z^^' — x < Zj!^^ stubs, so that lN,k{d) holds with probability 1, and hence the event 
in (|A.5.21() intersected with I'^ ^^{d) has probability equal to 0. 

This completes the proof of (|A.5.21|) and hence the Proposition. □ 
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